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Brief Reports 


The Journal of Consulting Psychology will 
accept Brief Reports of research studies in 
clinical psychology for early publication with- 
out expense to the author. The procedure is 
intended to permit the publication of soundly 
designed studies of specialized interest or lim- 
ited importance which cannot now be ac- 
cepted because of lack of space. Several pages 
in each issue will be devoted to Brief Reports, 
published in the order of their receipt with- 
out respect to the dates of receipt of the regu- 
lar articles. Most Brief Reports appear in the 
first or second issue to go to press following 
their final acceptance. 


An author who wishes to submit a Brief 
Report: 


1. Sends the Brief Report, limited to one printed 
page and prepared according to the specifications 
given below. 

2. Also sends to the Editor a full report of the re- 
search study, in sufficient detail to give a clear ac- 
count of its background, procedure, results, and con- 
clusions, which will be filed with the American 
Documentation Institute to insure indefinite avail- 
ability. 

3. Prepares at least 100 mimeographed copies of 
the full report, which the author will send without 


charge to all who request it as long as the supply 
lasts. 


4. Agrees not to submit the full report to another 
journal of general circulation. 


Specifications 


Brief Report. The Brief Report should give 
a clear, condensed summary of the procedure 
of the study and as full an account of the re- 
sults as space permits. 


To insure that the Brief Report will be no 
longer than one printed page, its typescript, 
including all matter except the title and the 
author’s lines, must not exceed 70 lines av- 
eraging 42 characters and spaces in length. 
Set the typewriter margins for short lines of 
42 characters, which are 3.5 inches long in 
elite typing, and 4.2 inches long in pica. 

The manuscript of the Brief Report must 
be double spaced throughout. Except for its 
short lines, it follows the standard style (1). 
Headings, tables, and references are avoided 
or, if essential, must be counted in the 70 
lines. Each Brief Report must be accom- 
panied by a footnote in the style below, 
which is typed on a separate sheet and not 
counted in the 70-line quota: * 


1An extended report of this study may be ob- 
tained without charge from John Doe, 300 Market 
St., Prospect 6, Mass. (giving the author’s full name 
and address), or for a fee from the American Docu- 
mentation Institute. Order Document No. from 
ADI Auxiliary Publications Project, Photoduplica- 
tion Service, Library of Congress, Washington 25, 
D. C., remitting in advance $—— for microfilm or 
$—— for photocopies. Make checks payable to Chief, 
Photoduplication Service, Library of Congress. 





Extended report. The full report is pre- 
pared in the style specified by the Publica- 
tion Manual (1), except that it may be typed 
with single spacing for economy in photo- 
duplication by the ADI. 
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1. American Psychological Association. Council of 
Editors. Publication manual of the American 
Psychological Association. Psychol. Bull., 1952, 
49, 389-449. 
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Psychotherapeutics: Success or Failure 


G. R. Pascal and Melvin Zax’ 


University of Tennessee 


Whether or not behavioral change can take 
place concomitant with procedures instituted 
in the psychological clinic is a matter of be- 
lief. Belief is a subjective matter and the na- 
ture of the evidence required to achieve it 
seems to vary from person to person. Thus, 
aborigines have been found who do not be- 
lieve that a relationship exists between sexual 
behavior and pregnancy in the human fe- 
male. No amount of statistical significance 
can convince them of this relationship. The 
facts do not fit into their theoretical frame- 
work. In our culture, there are individuals 
who believe that a person behaves the way 
he does because “he is possessed of the devil,” 
or because, for some supernatural reason, he 
is “meant to be that way.” Persons with such 
a frame of reference will not be convinced by 
the nature of available evidence bearing on 
behavioral change in clinical subjects. Other 
more sophisticated persons reject evidence on 
statistical grounds, implying that no one has 
“proved” that behavioral change in clinic sub- 
jects is a result of clinical procedures. They 
are, of course, correct. No one has “proved” 
it in that sense. The best available published 
evidence bearing on this matter is that re- 
cently put forth by Rogers and Dymond (4) 
in which behavioral change as a criterion vari- 
able is largely neglected. Satisfactory evi- 
dence, on statistical grounds, does not now 
exist. Without an established criterion, with- 
out the possibility of efficient controls, it is 
improbable that a rigorous scientific experi- 
ment can be conducted in this area at this 
time. 


1The authors wish to express their gratitude to 
Drs. K. R. Newton, C. N. Sipprelle, and C. H. Swen- 
sen, members of the Staff of the Psychological Serv- 
ice Center, who assisted in the preparation of this 
paper. 


Evidence, however, does exist. Belief in it 
will depend on the frame of reference of the 
individual. Certain biases can be singled out 
which seem to be related to the acceptance 
of the evidence available. One is a conviction 
that the human organism learns. This bias is 
one to which all of us will give lip service; 
lip service because, although we believe in the 
learning of reading, writing, and arithmetic, 
we do not, many of us, believe that other 
habits such as attitudes can be learned. An- 
other one seems to be the conviction that hu- 
man learning can be controlled, that is, that 
conditions can be set up which will foster or 
hinder the learning of specific pieces of be- 
havior. A third might be the belief that the 
human organism has the capacity to react 
with what it has learned in one situation to 
another similar situation. With these beliefs, 
one is in a fair way to accept the evidence 
bearing on behavioral change in clinical sub- 
jects as a result of or concomitant with cer- 
tain procedures in the clinical laboratory. The 
available evidence runs somewhat as follows: 

A subject is brought, or comes under his 
own power, to our clinical laboratory. It is 
determined, on grounds not relevant here, 
that the purpose of laboratory procedures is 
to obtain a decrement in the number of hours 
the subject spends alone. Whereas the sub- 
ject now spends, on the average, a total of 
ten out of seventeen waking hours alone, it is 
desired that he spend, on the average, less 
than five hours a day alone. Number of hours 
spent alone becomes our dependent variable. 
From a study of the subject’s past history 
and whatever we can learn from psychological 
tests, we formulate hypotheses concerning the 
intervening variables related to the depend- 
ent variable. Since we cannot observe or 
change hours spent alone outside the labora- 
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tory, we need to change behavior in the labo- 
ratory that is highly correlated with our de- 
pendent variable. Thus, in this particular 
case, we might set up as intermediate de- 
pendent variables such things as number of 
words uttered by the subject, the amount of 
time spent by him facing the experimenter, 
the changes in pitch of his voice (affect), 
random movements, posture, facial expres- 
sion, etc. We now set up a controlled stimulus 
situation in the clinical laboratory, which in 
interaction with the intervening variables, is 
calculated to affect the dependent variable. 

The behavior in question, number of hours 
spent alone, can vary in three directions. It 
can go up, down, or remain stationary. Our 
prediction is that it will go down. From the 
point of view of our prediction, it can only 
vary in two directions; it can go down or not 
go down. Disregarding all other matters ex- 
cept those under immediate discussion, the 
probability of effecting behavioral change in 
the required direction is one over two for the 
single case. 

We now institute our laboratory procedures 
and obtain behavioral change in the predicted 
direction. We get another case with a differ- 
ent, gross, dependent variable, inability to 
care for her infant in a young mother, and 
again obtain behavioral change in the pre- 
dicted direction. With other cases and other 
dependent variables we, again and again, ob- 
tain gross behavioral change in the laboratory 
that we “guess” is highly correlated with be- 
havior outside the laboratory. Since these are 
independent events, we can multiply prob- 
abilities. What level of significance will 
satisfy us, .01, .001, or .0000001? 

With this orientation, the writers entered 
the files of the Psychological Service Center 
of the University of Tennessee. This Service 
Center is operated by the Department of 
Psychology for training and research pur- 
poses in connection with the Clinical Psy- 
chology Training Program. It takes referrals 
of children and adults from agencies, physi- 
cians, the school systems and hospitals, to 
mention some of the referral sources. The 
Service Center takes all cases for diagnosis 
or treatment which are felt to be within the 
province of a clinical psychologist working in 
an outpatient setting. 
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The Service Center has been in operation 
in its present form since 1951. During that 
time there have accumulated 1,022 case files. 
The files were entered with the aid of a 
table of random numbers. Prior to entering 
the files, however, certain rules for selection 
were set up. All cases were used which met 
the following criteria: (@) any case which 
was terminated by consent of both therapist 
and patient, (5) any case of long-term, 
“deep” therapy which was in progress or was 
interrupted, where the patient was seen in at 
least one hundred interviews (called Type III 
therapy), (c) any case of supportive brief 
therapy which was in progress or was inter- 
rupted, where the patient was seen in at least 
thirty sessions (called Type II therapy), (d) 
any case where environmental manipulation 
was the major goal, where the patient was 
seen in as many as five interviews (called 
Type I therapy). These criteria were set up 
on the basis of considerable previous experi- 
ence. It was felt that cases which had less 
than the criterion number of interviews for 
the type of therapy indicated did not consti- 
tute an experiment. The criteria concerning 
what did or did not constitute an experiment 
in this context are outside the range of this 
discussion but are analogous, in the reason- 
ing behind them, to the criteria applied in 
other more systematic experimental settings. 
In any case, these rules were set up before 
entering the files. Cases were excluded in 
which large environmental effects beyond any 
possibility of control by the therapist inter- 
fered with therapy. In this manner ten cases 
were selected from the files. 

For each case, notation was made, from 
the case history, of behavior which was con- 
sidered deviant and it was desired to change, 
as reported either by the patient or someone 
else who gave the history, the latter very 
often being the case with child patients. 
Thereafter, after conclusion or interruption of 
therapy, a final summary of the case, prog- 
ress reports, or statements of the patient were 
used to make a judgment as to whether or 
not the behaviors previously noted changed 
in the predicted direction. No follow-up was 
used, no attempt was made to find out 
whether or not the patient maintained the be- 
havior change over a period of time. We hold 
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no brief for the objectivity of this procedure; 
we are cognizant of the possible biases of 
whoever wrote the case records, of the in- 
tangibles underlying the patient’s “good” re- 
ports, etc. The reliability of judgments as to 
whether or not behavior changed in the pre- 
dicted direction, is perfect. This finding is 
understandable because, disregarding the re- 
liability of the case histories themselves, the 
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questions asked of the case records could be 
answered by a simple yes or no. 

Table 1 shows the results obtained, ten hits 
and no misses. Skepticism seems appropriate 
for these data. Therefore we pulled out an- 
other ten cases meeting the criteria for se- 
lection. Table 2 shows the results, eight hits 
and two misses. Although expansion of the 
binomial indicated a satisfactory level of sig- 


Table 1 


First Group of Ten Meeting the Criteria for Selection 














No. of 
File inter- Predicted 
No. Age Sex views Behavior to be changed Change direction 
322 30 F 194 1. Fainting spells 1. None for 2 years Yes 
2. Overweight 2. Lost 14 pounds 
355 7 M 50 1. Fearful, withdrawing behavior 1. Active participation in school Yes 
in school activities, improved grades 
155 27 F 70 1. No social contacts with men 1. Began dating regularly Yes 
2. No contacts with parents 2. Began visiting parents 
402 28 F 113 1. Unable to care for children and 1. Caring for children and doing Yes 
perform household duties housework consistently 
2. Few social contacts 2. Began regular church attend- 
ance 
302 8 M 34 1. Submissive and passive behav- 1. Began having fights in school Yes 
ior, e.g., beaten by younger and took responsibility for 
children younger sibs 
2. Frequent accidents 2. None while in treatment or im- 
mediately after 
727 51 F 13 1. Uncontrollable weeping 1. Ceased completely Yes 
2. Insomnia 2. Sleeping well 
143 35 M 102 1. Multiple phobias 1. These subsided almost com- Yes 
pletely 
2. Frequent hostile outbursts 2. Ceased completely 
3. No dating 3. Frequent dating 
69 27 M 135 1. Drinking excessively to relieve 1. Not drunk for 10 months Yes 
anxiety 
2. Somatic symptoms, e.g., stom- 2. None for one year 
ach cramps 
3. Frequent prostitutes 3. None for one year 
108 26 M 214 1. Infrequent dating 1. More dating Yes 
2. Ashamed of poor hearing and 2. Bought hearing aid and used it 
refused to wear a hearing aid 
3. Few social contacts 3. Began to join clubs, etc. 
54 28 M 41 1. Frequent quarrels 1. These greatly diminished Yes 
2. No plans for future and dis- 2. Entered graduate school with 


satisfied with job 


plans 
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Table 2 


Second Group of Ten Meeting the Criteria for Selection 














No. of 
File inter- Predicted 
No. Age Sex views Behavior to be changed Change direction 
380 8647 M 10 1. Acute alcoholic episode where 1. Sobered up Yes 
he was continually drunk 
2. Unable to work 2. Working steadily for 2 years 
369 389s 42 M 18 1. Using narcotics 1. None for 14 years Yes 
2. Crying spells 2. None for 2 years 
3. Hypochondriacal complaints 3. None for 2 years 
884 5 M 60 1. Unintelligible speech 1. Now easily understood Yes 
2. Poor coordination (muscular) 2. Better coordination 
3. Not completely toilet trained 3. Toilet training completed 
4. Played with electrical outlets 4. No shocks for 10 months 
and received severe shocks 
384 19 M 162 1. Several attempts at suicide 1. None for 24 years Yes 
2. Crying, tremulousness 2. None for 24 years 
3. Poor grades at college 3. Better grades 
184 32 M 315 1. Took too much insulin for dia- 1. This corrected Yes 
betes against doctor’s orders 
2. Hypoglycemic attacks 2. None for 3 months 
3. Frequent awakening at night 3. Ceased 
4. Very frugal 4. Now buys furniture, etc. and 
takes trips 
5. Very passive in dealing with 5. Has been able to fire subordi- 
subordinates nates when necessary 
974 32 M 25 1. Had failed doctoral language 1. Passed language exam and pre- Yes 
exam twice lims later 
2. Verbally threatened violence 2. Stopped 
toward a professor 
865 21 F 48 1. Few close friends 1. No change No 
430 29 M 40 1. Overt homosexual acts 1. No homosexual acts for 8 Yes 
months 
2. Frequented prostitutes 2. None for one year 
3. Constant extreme muscular ten- 3. Moderate tension occasionally 
sion 
292 38 M 252 1. Inhaled nitrous oxide and in- 1. Gradual decrease until total Yes 
duced hallucinations stop 6 months ago 
2. Sat up nights listening to Bible 2. Ceased 
on tape recorder 
171 22 M 31 1. Excessive drinking 1. This reduced No 
2. High blood pressure, somatic 2. Became more severe and hos- 
weakness, burning sensation in pitalization became necessary 
stomach 
3. Hyperactive 3. No change 





nificance even for this finding (.05) we were 
nevertheless worried by the trend indicated 
in the two misses obtained. We selected an- 
other ten folders from the files. Table 3 shows 


the results, ten hits and no misses. The find- 
ing satisfied us. We felt we could safely bet 
on the side of the “hits.” 

Tables 4 and 5 show all of the cases pulled 
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from the files, 350. It will be noted that the 
bulk of these were diagnostic only. The cate- 
gory “Inquiry Only” exists because any first 
interview conducted at this Center is usually 


written into the files despite the fact that the 
inquiring person may never return to the 
Center. The second largest category listed in 
the table is that labeled “Too Few Sessions.” 


Table 3 
Third Group of Ten Meeting the Criteria for Selection 














No. of 
File inter- Predicted 
No. Age Sex views Behavior to be changed Change direction 
405 8 M 47 1. Failing grades at school 1. Began receiving passing grades Yes 
2. Frequent disciplinary problem 2. Marked reduction in misbehav- 
ior 
895 24 F 5 1. Indecision over impending en- 1. Became engaged Yes 
gagement 
2. Dreams and nightmares 2. These ceased 
3. Had broken engagements in 3. Got married 
past and avoided becoming se- 
rious with boys 
14 29 F 197 1. Suicide attempt 1. No further attempts Yes 
2. Constant sitting on husband’s 2. Ceased 
lap 
3. Weeping behavior 3. Ceased 
4. Few social contacts 4. Began attending social affairs 
often 
385 4 M 163 1. No speech 1. No change Yes 
2. No response to others 2. Attempts to communicate by 
pointing 
3. Assumed positions, posturing 3. This done infrequently 
4, Habitually constipated 4. No problem for 6 months 
616 25 F 44 1. Few social contacts 1. Joined clubs, etc. Yes 
2. No pleasure in sex relationships 2. Began to achieve orgasm 
3. Fear of driving a car 3. Learned to drive 
67 32 M 93 1. Wife beat him 1. This ceased and at times he Yes 
struck her 
2. Spells of extreme physical weak- 2. None for about 3 years 
ness 
3. Crying spells 3. None for about 3 years 
110 8622 M 25 1. Attacks of nausea and depres- 1. Ceased for about 34 years Yes 
sion 
2. Left school and was indecisive 2. Returned to medical school 
about future 
226 20 M 65 1. Failing courses at school 1. Improved grades and graduated Yes 
2. Not eating and sleeping well 2. Eating and sleeping well 
170 32 M 155 1. Worked over-long hours 1. Ceased Yes 
2. Unable to communicate to wife 2. Easier verbalizations with 
or subordinates at work people 
325 22 F 77 . Excessive sleeping . This ceased Yes 
. Exhaustion and nausea often . This ceased 


- wd 


. Hypnogogic experiences 
. No dating 


- whe 


. These remained 
. Got married 
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This category is probably the one that would 
be most suspected in estimating the effects of 
psychotherapy in our presentation. It will be 
noted that there are more cases in this cate- 
gory than the thirty that were accepted for 
the study. Whether or not these can be con- 
sidered failures in psychotherapy is a matter 
for discussion beyond the realm of this paper. 
We feel that the matter is irrelevant to our 
main purpose. The category “Environment 
Effect” is broken down and self-explanatory. 
Of the thirty cases retained for the analysis, 
the table shows that there were five children 
and twenty-five adults. Most of the adults 
were in what we call Type II therapy which 
consists of attempts to modify variables 
within and without the person. Type I ther- 
apy is simply environmental manipulation. In 
Type III therapy an attempt is made to con- 
centrate on variables within the individual to 
effect behavioral change with relatively little 
attention being paid to the environment. 
Obviously, the tables do not “prove” any- 
thing. It is hoped, however, that they might 
contribute to belief and, more importantly, 
illustrate the possibility of a systematic ap- 
proach to the problem of success or failure in 
psychotherapy. It should be pointed out that 
we have not been concerned with whether the 
patient was “improved” or “unimproved” as 
has been done in previous studies (1, 2). It 
is impossible to see how any judgment can 
be made, let alone refined statistical pro- 
cedures applied with unreliable data such as 
“improved” or “unimproved.” Nor have we 
been concerned with whether or not the 
mechanisms of ego-defense became more suit- 


Table 4 


Analysis of Cases Rejected 











Reason for rejection Number 
Diagnostic only 231 
Inquiring only 34 
Environmental effect 12 
Couldn’t pay 5 
Hospitalized 3 
Left town 3 
Withdrawn by physician 1 

Too few interviews to meet criterion 43 
Total cases rejected 320 
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Table 5 
Analysis of Cases Retained 














Behavior changed 

Mean in predicted 

No. of direction 
Type of Hours 
therapy Seen Number Yes No 
Type I 10 1 1 0 
Type II 60 16 14 2 
Type ITI 184 8 8 0 
Play 71 5 5 0 

Total cases retained 30 





able (3) or if the patient was more comfort- 
able or optimistic (4) or a host of other simi- 
lar measures proposed to estimate the effects 
of psychotherapy. 

By all means let us make the patient’s ego- 
defenses more suitable. Let us change his 
attitudes about himself and others. But these 
have no significance if they are not reflected 
in the behavior of the patient as it is dis- 
played in the environment in which he has to 
live. Making ego-defenses more suitable (or 
what have you) may be an hypothesis con- 
cerning a way to effect behavioral change but 
it cannot be used as a criterion. Neither can 
unvalidated tests and questionnaires (no mat- 
ter how much they are Q-sorted) be used as 
criteria. If the ultimate purpose of psycho- 
therapy is to effect behavioral change (and 
it has to be) then behavior must be the cri- 
terion. This proposition not only makes sense 
but it gets us closer to the science of psy- 
chology. 

The writers are aware that something other 
than mere behavioral change is considered in 
the problem of success or failure in psycho- 
therapy. There is, for instance, the considera- 
tion of time during which behavioral change 
is maintained after psychotherapy. The an- 
swer to this question pushes us back to our 
systematic formulation. If behavior is changed 
by simple environmental manipulation (Type 
I) then the maintenance of changed behavior 
will depend on the stability of the changed 
environment. If, on the other hand, the proc- 
ess of changed behavior was accomplished ex- 
clusively by manipulation of variables within 
the individual (Type III), then the changed 
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behavior should be maintained with relative 
freedom from the environment. No thorough- 
going study with results of psychotherapy 
can neglect a consideration of procedure used 
to effect behavioral change if the time di- 
mension is to be added to the notion of suc- 
cess or failure. Another consideration is that, 
although success or failure may be concerned 
with the achievement, in the environment, of 
a desired and predicted change in behavior, 
the question as to what behaviors, and how 
much they have to be changed, is unknown 
at this writing. This consideration leads us 
into the morass known as “psychological 
health.” We shall by-pass this matter and 
end our polemic by asserting that we believe 
a person’s behavior can be changed in a pre- 
dicted direction by the systematic use of well- 
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established principles known to the science 
of psychology. 


Received May 28, 1956. 
Early Publication. 
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Relationship of Success and Length in Psychotherapy 


James W. Taylor 


Veterans Administration Hospital, Denver, Colorado 


Cartwright (1) has suggested the existence 
of a “failure zone” between the 13th and 21st 
interview in client-centered therapy. The pres- 
ent report is made in order to indicate a 
similar result occurring in a psychoanalyti- 
cally oriented clinic. In the process of other 
research, the total number of cases closed in 
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leaves much to be desired, such judgments 
are common in most clinics operating under 
practical restrictions. Figure 1 combines the 
data presented by Cartwright in three graphs 
with the corresponding data of this clinic. 
Representation of the two scales of improve- 
ment on the same figure was not meant to 
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Fig. 1. Rated success as a function of number of interviews. The break in the horizontal axis at 30 in- 
terviews indicates that both the class intervals of 29 and 33.5 include the 30th interview (4 cases) which 
was necessary in combining the graphs. N indicates the number of patients in each class interval. 


the clinic during a fiscal year (309 cases of 
varying diagnosis and with different thera- 
pists) were evaluated for number of treat- 
ment interviews and for results of treatment. 
Results of treatment are evaluated as, 0 un- 
improved, 1 mildly, 2 moderately, and 3 
greatly improved by the therapist and con- 
sultant. While this criterion for improvement 


imply their equality; rather, the purpose is 
to illustrate the similarities of the curves. 


Received January 13, 1956. 


Reference 


1. Cartwright, D. S. Success in psychotherapy as a 
function of certain actuarial variables. J. con- 
sult. Psychol., 1955, 19, 357-363. 


332 











Journal of Consulting Psychology 
Vol. 20, No. 5, 1956 


The Representation of Orthopedic Disability in 
Children’s Figure Drawings 


A. B. Silverstein and H. A. Robinson 


Psychiatric Institute, University of Maryland School of Medicine 


Twenty years have passed since Schilder 
first expressed his belief that “the way in 
which children draw human figures really re- 
flects their knowledge and sensory experience 
of the body image” (14, p. 106). Since that 
time many investigators have developed this 
observation both at the level of theory and 
in clinical practice. Body image has been 
variously described by different writers, and 
the lack of a universally accepted definition 
has undoubtedly served as a deterrent to sys- 
tematic research in this problem area. None- 
theless there has remained a widely held be- 
lief that drawn human figures do reveal some- 
thing of the subject’s conceptual picture of 
his own body. Machover, the most active 
proponent of dynamic interpretations of fig- 
ure drawings, has stressed that “the process 
of drawing the human figure is for the sub- 
ject . . . a problem not only in graphic skill, 
but one of projecting himself in all of the 
body meanings and attitudes that have come 
to be represented in his body image” (10, p. 
35). Today, figure drawings are a commonly 
accepted tool for investigating personality 
dynamics, self concept, and body image; a 
familiar part of the clinical battery. This 
technique, however, shares with many other 
projective instruments the need for convinc- 
ing and unequivocal demonstrations of va- 
lidity. 

The present authors are engaged in an in- 
vestigation of psychological factors in the 
treatment of and recovery from poliomyelitis,’ 
and have had the opportunity to study fig- 
ure drawings of children in various stages of 


1 Aided by a grant from the National Foundation 
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that illness. In exploratory phases of this 
work we were frequently impressed with what 
appeared to be a rather direct reflection of 
the child’s disability in his drawings, and, 
given impetus by these observations, decided 
to initiate a more systematic study of the 
technique. If this procedure could be demon- 
strated to yield a true picture of body image, 
it would prove extremely useful, since data 
from other sources than figure drawings have 
shown that the discrepancy between body im- 
age and the actual physical body is related 
to adjustment processes in physical disability 
(2). It should also be noted that figure draw- 
ings as such have received scant attention as 
possible measures of attitudes toward disabil- 
ity. A nonverbal measure of disability-associ- 
ated attitudes and modes of adjustment would 
be of great value to workers who deal with 
the disabled. 

The earliest study of the figure drawings of 
handicapped subjects is reported by Machover 
(10). Her brief account of this investigation 
of 20 orthopedic cases provides no methodo- 
logical details. It would appear that drawings 
were simply scrutinized for the possible pro- 
jection of disturbances of body image, and 
the preliminary results are said to have been 
striking. Other investigators (1, 6, 12, 16) 
have taken a similar approach to drawings 
produced by various types of disabled sub- 
jects, and their results, while not as impres- 
sive as Machover’s, seem to be encouraging. 
Taken together, these studies suggest that, 
while relatively few subjects portray their 
disabilities directly, indirect evidence in the 
form of distortion and displacement occurs 
frequently. 
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1. Study of Subjective Classifications 


Method. The writers have used a sorting 
approach in studying a series of drawings by 
disabled children. Same-sex, opposite-sex, and 
self-figure drawings were obtained from a 
group of 22 chronic poliomyelitis cases with 
paralytic residuals of the lower extremities. 
Each of the three sets of drawings was evalu- 
ated independently by the authors. With no 
preliminary formulation of a classification 
scheme, and with negligible disagreement be- 
tween the judges, we found that these draw- 
ings could readily be sorted into six cate- 
gories. The distinguishing features of these 
groupings were: (@) omission of a portion of 
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the legs; (6) external support for the drawn 
figure, e.g., by an orthopedic appliance; (c) 
marked size discrepancy of the legs; (d) im- 
proper attachment of the legs to the trunk; 
and (¢) unusual treatment of the upper ex- 
tremities, with the legs judged to be within 
normal limits. The final category (f) in- 
cluded drawings considered to embody neither 
direct nor indirect evidence of disability. 
Results. Four of the 22 self-figures are pre- 
sented in Fig. 1. Though the drawings differ 
a great deal, each appears to show clearly 
and dramatically some representation of im- 
pairment. In one way or another these chil- 
dren seem to have projected their physical 
condition upon their drawn figures. Nor are 
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Fig. 1. Self-figures drawn by four children with lower extremity disability following poliomyelitis. 

A. Self-figure by a ten-year-old boy in Grade 4. He wears bilateral full-leg braces and has a marked 
residual weakness of both legs as a result of a spinal poliomyelitis at age two. 

B. Drawing by a boy, a nine-year-old second-grader. Disease onset was at age five. 
restricted to the right leg: he wears a short leg brace. 

C. Drawing by a nine-year-old girl in the second grade. Since the poliomyelitis onset before her first 
birthday, she has been readmitted to an orthopedic hospital on five occasions for observation and cor- 
rective surgery. Severe paralytic residuals of both legs. She wears a long leg brace with corrective 
shoes. Note that the legs of her figure are “paper-chopped.” 

D. Figure by a ten-year-old girl in Grade 3. The disease onset occurred at age 5. Her right leg is es- 
sentially flail below the knee, and she wears a long leg brace with pelvic band. This child has had three 
hospital readmissions for corrective surgery. Note that she has faithfully pictured her brace, though in 
mirror-fashion. 


Residual is now 
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Table 1 


Classification of Figure’ Drawings by 22 Children 
With Lower Extremity Disability 
Following Poliomyelitis 











Same- Opposite- Self- 
Group sex sex figure 
Portion of lower extremities 
omitted 5 3 3 
External support provided 0 4 
Size discrepancy in lower ex- 
tremities 6 7 
Improper attachment of legs 3 4 2 
Unusual treatment of upper 
extremities 4 3 4 
No evidence of disability 4 5 4 
N 22 22 22 





these atypical examples; many of the draw- 
ings, by direct representation, displacement, 
distortion, or other unusual treatment, are 
equally suggestive of body-image disturbance. 

Table 1 presents the distribution of figure 
drawings among the six categories which have 
been described. In each of the three sets of 
drawings, more than three-quarters of the 
subjects were judged to have portrayed a 
physical disability, either directly or indi- 
rectly. By inspection the three sets of draw- 
ings show similar distributions among the 
categories, with the one exception of the “ex- 
ternal support” classification. In this instance 
three self-figures were depicted with ortho- 
pedic appliances, and one seated at a desk. 
This might suggest that the most direct rep- 
resentation of disability may be found in self- 
figures. 

An additional observation should be made 
with respect to a pattern of consistency found 
in these drawings. If an individual omits a 
part of the legs of one of his figures, for ex- 
ample, he tends to treat the other figures 
similarly. Of the 22 subjects, 16 have either 
two or three of their drawings in the same 
category, whereas by chance this should oc- 
cur only 9 or 10 times. This difference from 
chance expectancy is significant at the .01 
level. 

Discussion. While these findings, like those 
of the investigations cited above, appear very 
encouraging, it is clear that this subjective 
approach to the study of figure drawings is 
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seriously deficient from the point of view of 
scientific rigor. For one thing, the judges who 
evaluate the drawings are not only aware that 
they are dealing with productions of handi- 
capped individuals, but also have some gen- 
eral knowledge of the nature and site of the 
disability. This may introduce bias in at least 
two ways: the observer may tend to over- 
weigh minor deviations as evidence of dis- 
ability, or he may de-emphasize certain char- 
acteristics not obviously related to the known 
impairment. Another difficulty in this method 
of dealing with figure drawings stems from 
the enormous variety of the productions. 
Even when working with small samples, one 
must arbitrarily limit the number of classifi- 
cations to avoid the unmanageable prolifera- 
tion of categories. And this inevitably poses 
the need for an equally arbitrary system of 
priorities, since many drawings will embody 
the characteristics specified in more than one 
category. Drawing A in Fig. 1 illustrates this 
problem: the figure lacks feet, and the legs 
are attached to the body in an unusual way. 
An even stronger objection to this approach 
is that the frequency of occurrence of the 
various drawing features in a comparable 
physically normal sample is unknown. To the 
extent that the drawings of clinically normal 
individuals are found to exhibit the same 
characteristics as those of disabled persons, 
the utility of the technique is diminished. 
Taken together, criticisms of this sort seri- 
ously undermine any conclusions which may 
be drawn from studies employing subjective 
classification and appraisal of human figure 
drawings. To be sure, normal controls and 
objective methods have not been entirely neg- 
lected in studying figure drawings of the dis- 
abled. Several investigators (3, 12, 15) have 
compared the drawings of handicapped and 
normal individuals, using checklists of signs 
or physical measurements of the drawings. 
Without exception these investigations yielded 
negative results, in the sense that drawings 
of disabled and normal subjects were not re- 
liably distinguished. In each case, however, 
it would appear that flaws in sampling or 
methodology may have had the effect of ob- 
scuring differences in the drawings. In this 
connection the following points deserve men- 
tion: Failure to control for correspondence 
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Table 2 
Comparison of Disabled and Control Subjects with Respect to Age, Intelligence, and Grade 
Age IQ Grade 
Group Mean SD Mean SD Mean SD 
Disabled (VN = 22) 11.2 1.19 94.6 9.27 4.7 1.48 
Control (WN = 44) 11.1 1.02 97.6 7.26 5.2 0.94 
t 0.48 1.40 1.52 
F 1.39 1.67 2.54 
p >.10 >.10 >.10 >.10 >.10 <.05 





between the sex of the subject and that of his 
drawn figure denies, in effect, the possibility 
of differential identification with, and projec- 
tion on, same-sex and opposite-sex figures. In- 
clusion of items in scoring schemes which are 
not directly related to the known impairment, 
or which cannot in reason be justified as pos- 
sible displacements or symbolizations of the 
disability, may needlessly elaborate the meas- 
uring instrument without increasing its sensi- 
tivity. An example of a seemingly irrelevant 
item, reported in the literature, is one deal- 
ing with the Adam’s apple in a study of the 
drawings of disabled children. Use of ex- 
tremely heterogeneous groups—a child with 
a congenital deformity of an arm in the same 
sample with a child having an osteomyelitis 
of the leg, for example—seemingly disregards 
the psychological significance of the nature 
and site of the disability. 

It should fairly be stated that the hetero- 
geneity of disabled individuals poses a diffi- 
cult and fundamental problem for any re- 
search effort in this area. Besides the sources 
of variation found in physically normal indi- 
viduals, such considerations as site, degree, 
and duration of disability make it virtually 
impossible to secure properly homogeneous 
samples of adequate size. Thus the research 
worker must knowingly and selectively sacri- 
fice such variables as seem least pertinent to 
his goal, and at the same time remain sensi- 
tive to the need for homogeneity. The 22 sub- 
jects of the present study can be considered 
homogeneous to this extent: They share the 
same disease entity and the same general site 
of disability. They are enrolled in a public 
school equipped to deal with orthopedically 
disabled children who by reason of their 
handicap cannot attend their regular classes. 


This is an environmental control of a sort, 
and also ensures that each subject has a 
marked degree of disability. Age of onset 
varies greatly in this sample, but each child 
was at least a year and a half past the acute 
stage of the disease when he was seen. 


2. Controlled Study of Signs 


Method. Having considered these subjects 
in the light of factors related to their dis- 
ability, it is now necessary to report some 
other characteristics of the group, and to de- 
scribe a sample of normal subjects who were 
introduced as a control. The disabled children 
ranged in age from 9 to 12 and were equally 
divided by sex. Over 100 physically normal 
children from the same metropolitan school 
system were tested, and 44 of these sub- 
jects were selected to form a group com- 
parable to the disabled sample with respect to 
age, grade, and reported intelligence quotient 
(Kuhlmann-Anderson). These 44 nondisabled 
children, equally divided by sex. also ranged 
in age from 9 to 12. The control group was 
established at twice the size of the experi- 
mental sample to increase the reliability of 
comparisons, and because physically normal 
subjects were easily available. The group of 
children with poliomyelitis residuals, on the 
other hand, represents nearly all available 
cases. Table 2 presents a comparison of the 
two groups. 

The equation of the two groups is seen to 
be entirely acceptable for age and _ intelli- 
gence and mean grade placement; no signifi- 
cant differences are obtained. The signifi- 
cantly greater variance in grade placement 
observed in the disabled sample may well 
reflect differing periods of absence from school 
during treatment for poliomyelitis. With re- 
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spect to the measures reported, it is sub- 
mitted that the two groups are essentially 
comparable. 

Each sample was tested in a classroom 
situation during the school day. The conven- 
tional figure-drawing procedure was followed: 
the subjects were instructed to draw a per- 
son, and a person of the opposite sex from 
the first. They were then asked to draw pic- 
tures of themselves. No time limits were set, 
and no further directions were supplied. 

To analyze the drawings obtained, a list 
of 55 scoring items or signs was developed. 
Thirty-seven of the signs deal specifically 
with types of treatment which may be given 
to lower extremities in figure drawings; for 
example, shape of foot, length of leg, and 
omissions. Six signs call for comparisons be- 
tween the arms and legs of the figures, as in 
noting differences in the use of shading. The 
remaining twelve signs involve no considera- 
tion of the legs of the figure, but pertain to 
other features which might be expected to re- 
flect body-image changes resulting from dis- 
ability, such as the size of the figure and its 
placement on the paper. Most of these scor- 
ing signs were taken directly or adapted from 
the reports of Buck (5), Goodenough (8), 
and Machover (10), and the remainder de- 
vised by the present writers. 

For each sign, specifications were drawn 
up to increase objectivity of scoring, and a 
preliminary test of the procedure was satis- 
factorily completed. The drawings of the ex- 
perimental and control groups, combined in 
a random order, were then scored independ- 
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ently by the authors. Each sign was scored 
plus, minus, or unscorable. As an example, a 
figure with misshapen feet would receive a 
plus score for a particular item; a figure with 
feet judged to be normal would be scored 
minus; and a figure from which the feet had 
been omitted would be considered unscorable 
for this item. Of the 10,890 individual signs 
(55 signs for each of three drawings by 66 
subjects), 88 per cent were scored in agree- 
ment by the two judges. Scoring disagree- 
ments were later resolved in conference. For 
individual signs, the percentage of initial 
agreements ranged from 54 to 98 per cent, 
but for only three items was it less than 80 
per cent. An agreement of 33 per cent would 
be expected by chance. On the whole, the ob- 
tained scoring reliability is considered to be 
acceptable. 

Results. In statistical analysis, the three 
sets of drawings (same-sex, opposite-sex, and 
self-figures) were treated separately. For each 
set, chi square corrected for continuity was 
employed to compare the drawings of the two 
subject groups on the 55 signs. Fisher’s 
method (7, p. 96) was used to determine the 
exact probability associated with chi squares 
which approached significance. For each com- 
parison, the null hypothesis was tested against 
the alternative that the disabled group con- 
tained a greater proportion of subjects scor- 
ing plus on a given sign. Table 3 presents a 
summary of the statistically significant dif- 
ferences found among the 165 tested. 

In the 55 comparisons made for each of 
the three sets of drawings, two or three dif- 


Table 3 


p Values Exceeding the Five Per Cent Level in Comparison of Figure Drawings of 
Disabled and Control Subjects 


(Total number of comparisons for each figure is fifty-five) 











Item Same-sex Opposite-sex Self-figure 
3. Figure placed at side edge of paper 039 
16. Horizontal midline (waist area) emphasized 017 
20. Feet differ in size or shape (front view) 035 
22. Leg drawn lighter, thinner, or fainter than corresponding arm 049 
23. Figure placed above horizontal midline of paper 014 
26. Length of leg greater than twice length of trunk .037 
28. One leg thinner than the other 029 
51. Width of leg increases toward foot 039 
55. Leg attached to but not continuous with body 0002 009 
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ferences significant at the .05 level or better 
might be expected on the basis of chance 
alone. It would be necessary to obtain at 
least six such differences to rule out the op- 
eration of chance factors (at the .05 level) 
(13). The number of differences actually 
found to be significant at the .05 level or 
better, for the same-sex, opposite-sex, and 
self-figures, was two, five, and three respec- 
tively.* It is accordingly clear that the draw- 
ings of the disabled and normal subjects have 
not been reliably distinguished by the scoring 
system employed. 

Discussion. The results of this objective 
and analytic approach are plainly less en- 
couraging than those obtained when the 
drawings, as described earlier, were sorted 
into a limited number of categories. It hap- 
pens that two drawing features are given 
similar consideration in these widely differ- 
ent methods of appraisal, and it is of some 
passing interest to scrutinize them. In the 
initial sorting procedure, malattachment of 
legs to body was thought to be a rather 
direct representation of impairment. Among 
the 55 signs there appears a virtually identi- 
cal item (No. 55). The sign analysis does in 
fact show that—for two of the three figures— 
the disabled group contains a significantly 
greater proportion of subjects scoring plus on 
this item. In the other instance, the omission 
of a part of the leg placed a drawing in one 
of the original categories, and again this item 
is duplicated in the list of signs. Here the 
sign approach does not appear to bear out 
the original impression, namely, that omitting 
a lower extremity part is an evidence of dis- 
ability. In their drawings the handicapped 
children do not more frequently omit parts 
of legs than do the controls. On the con- 
trary, a significantly greater proportion of 
the normal subjects makes such omissions. 
(The difference, for which p is less than .05, 
lies in the unpredicted direction, and is there- 
fore not reported in Table 3.) This, clearly, 
is an example of a finding of a subjective and 


2 These figures refer only to differences significant 
in the predicted direction. Three differences which 
were statistically significant in the unpredicted di- 
rection—where the control group exceeded the dis- 
abled sample in proportion of plus scores—are not 
reported in Table 3. 
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impressionistic approach which does not stand 
up under more careful scrutiny. 

From the methodological point of view, any 
subjective classification method is clearly in- 
ferior to the objective and analytic sign ap- 
proach. It should, however, be noted that 
this latter method is itself not beyond criti- 
cism. As mentioned earlier, the bulk of signs 
used in this study dealt specifically with types 
of special treatment given to the legs of the 
drawn figures, with the rationale that the pri- 
mary disease residual was in the lower ex- 
tremities of the subjects. Yet it may be that 
restricting the choice of signs in this way has 
unduly de-emphasized other distinctive treat- 
ments, as when the body-image representa- 
tion of the physical disability generalizes, or 
is displaced to other portions of the body 
than the legs. At a somewhat different level, 
the criticism has been offered that “‘the check- 
list [of signs] violates that holistic, dynami- 
cally oriented approach which competent cli- 
nicians use in making drawing analyses” (4, 
p. 179). Criticisms of this kind seem entirely 
reasonable, and must be taken seriously. It is 
possible, with the materials of the present 
study, to utilize a more global, “clinical” ap- 
proach and at the same time retain a meas- 
ure of scientific rigor. 

Clinical judgments as to whether figure 
drawings were executed by disabled or nor- 
mal subjects have been reported in the litera- 
ture (3, 15). The findings of these studies 
have been negative, i.e., the drawings of nor- 
mal and disabled groups were not reliably 
distinguished. However, some of the same 
flaws (heterogeneous samples and the fail- 
ure to control for sex) found in studies in- 
volving the sign approach may have also 
served to obscure differences between groups 
in investigations using clinical judgments. 
The relative homogeneity of subjects in the 
present study, and control of the sex factor, 
here provide a better opportunity for such 
differences to appear. 


3. Controlled Study of Global Judgments 


Method. Each of three judges was given the 
three randomized sets of same-sex, opposite- 
sex, and self-figure drawings, with instructions 
to select—by whatever cues the drawings 
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might provide—the 22 figures in each set 
which had been produced by the physi- 
cally disabled children. The three judges 
were postdoctoral clinical psychologists ex- 
perienced in the psychodiagnostic use of fig- 
ure drawings. Before evaluating the draw- 
ings, they were given information about the 
general characteristics of the two subject 
groups (age, sex, and IQ), and about the 
special characteristics of the chronic polio- 
myelitis children (nature, site, and duration 
of disability). For an individual figure, the 
judges knew only the sex of the child who 
had made the drawing; their task, in effect, 
was to choose—from 66 drawings—11 draw- 
ings by boys and 11 by girls which they con- 
sidered to be productions of disabled subjects. 

Results and discussion. The choices of each 
of the three judges were first compared with 
those of the others, for the three sets of draw- 
ings. On the basis of chance, 56 per cent of 
the judgments of any two clinicians would be 
expected to agree. (Two judges, who agree 
by chance on one-third of their choices of 
disabled children’s drawings, will also have 
implicit agreement on two-thirds of their 
evaluations of figures by normal subjects.) 
The actual percentage of agreements between 
judges ranged from 52 to 79 per cent for the 
nine possible combinations (three sets of 
drawings seen by three judges). Four of the 
comparisons yielded better than chance agree- 
ment (.05 level), suggesting that to some ex- 
tent the judges were using similar cues. 

In considering the ability of the judges to 
distinguish between the drawings of disabled 
and normal children, 56 per cent of the se- 
lections might again be expected to be cor- 
rect by chance. Correct judgments actually 
ranged from 52 to 64 per cent for the nine 
sets of evaluations. No one set of drawings 
was more accurately appraised than any 
other, and no one judge was superior to the 
others in his ability to distinguish among the 
drawings. Nor were any judge’s ratings sig- 
nificantly better than chance. 

These experimental findings, like those ob- 
tained with the sign approach, do little to 
enhance the value of the figure-drawing tech- 
nique for the study of body image in physi- 
cal disability. The promise of the original 
impressionistic sorting method has been ful- 
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filled neither by an objective, analytic pro- 
cedure, nor by a controlled holistic approach. 


Discussion and Implications 


A number of hypotheses may be suggested 
to account for the negative results which have 
been reported, though none of these hypothe- 
ses can be tested with the present data. Har- 
ris and Goodenough, on the basis of a large- 
scale investigation of children’s drawings, 
have concluded that it is somewhat doubtful 
whether children actually do project the body 
image upon drawn figures. They remark that 
“so much of the drawing technique in chil- 
dren under 12 or 13 years of age relates to 
concept formation and understanding, that 
probably only a small part of drawing be- 
havior relates to the child’s interest, tempera- 
mental qualities, and other personality fea- 
tures.” * Since few of the subjects in the 
present study were older than twelve, it may 
be that only a small part of the drawing be- 
havior sampled is related to body image. 

Levy’s exposition of the technique contains 
some pertinent observations. Noting that 
“some clinicians interpret each drawing as a 
projection of the body image” (9, p. 260), 
he acknowledges that this sort of projection 
does frequently occur. He nonetheless goes 
on to point out that a drawing may also rep- 
resent many other things, e.g., “a projection 
of attitudes toward someone else in the en- 
vironment, a projection of ideal self-image, or 
a result of external circumstances.” Levy re- 
marks, moreover, that “it is usually a com- 
bination of all of these.” Little wonder that 
the body image is so difficult to tease out of 
so complex and intricate a production! 

A third possible explanation for the nega- 
tive results stems from an observation by 
Machover, who writes that “a leg or an arm 
may be omitted or distorted for reasons of 
crippling anxieties, just as surely as for the 
physical crippling of those limbs” (11, p. 
263). This is a powerful and compelling ar- 
gument: if psychological and physical factors 
are reflected equally and indistinguishably 
in drawings, any attempt to differentiate the 
drawings of disabled and normal subjects 


8 Harris, D. B., personal communication, January, 
1955. 
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seems doomed to failure, however rigor- 
ously controls are established. Having made 
this attempt with particular and systematic 
attention to problems of control, sampling, 
and methodology, and without securing posi- 
tive results, we are now inclined toward this 
pessimistic point of view. The present study 
and other similar investigations have rested 
on the assumption that the physical body, 
the body image, and the drawn figure are in 
isomorphic relation. Given careful explora- 
tion, this one-to-one relationship does not 
seem to exist. 

We are not prepared, on the other hand, to 
concede that the figure-drawing technique is 
without potential value in the study of physi- 
cally disabled persons. A more individualized 
approach, focusing on the adjustment pat- 
terns of each subject, may bring greater un- 
derstanding of the widely varying modes of 
expression displayed by poliomyelitis children 
in their drawings. For example, to consider 
those children whose drawings include a clear 
depiction of their orthopedic appliance—are 
they, as Machover suggests, ‘‘basically body- 
centered individuals whose whole lives have 
been blighted by their handicap?” (11, p. 
262). Or are they children who have so com- 
pletely accepted their handicap and incor- 
porated it in their body image, that they feel 
free to represent it in their drawings? In what 
ways do such children differ from those who 
completely omit the legs in their drawings, 
and from those whose drawings show no evi- 
dence whatever of physical disability? In fu- 
ture research designed to answer such ques- 
tions, we propose systematically to explore 
these individual differences in terms of psy- 
chological processes relevant to physical dis- 
ability, e.g., defense mechanisms. We are, 
finally, in complete agreement with Mach- 
over’s observation that “organs are set in per- 
sonalities, and have no disconnected life of 
their own, and... the total reaction to 
handicap . . . is a more vital measure than 
the specific concrete projection” (11, p. 264). 


Summary 


Sets of human figure drawings—same-sex, 
opposite-sex, and self-figures—were obtained 
from 22 children in the chronic stage of 
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poliomyelitis. These drawings were studied 
by three different meihods to determine 
whether they reflected the physical disability 
of the subjects. 

1. By inspection, more than three-quarters 
of the subjects appeared to represent their 
disability, either directly or indirectly, in 
their drawings. 

2. Comparison of drawings of the disabled 
children with those of normal controls, 
equated for age, sex, and IQ, showed a non- 
significant number of significant differences 
between the two groups on a series of 55 
scoring signs. 

3. Judges experienced in the psychodiag- 
nostic use of figure drawings were unable to 
differentiate the drawings of disabled and 
normal subjects at a level better than chance. 

Various hypotheses were suggested to ac- 
count for the negative results, and a weak- 
ness in the basic assumption underlying the 
present investigation and similar studies was 
noted and discussed. Plans for future re- 
search, involving a more individualized and 
personality-centered approach to the human 
figure-drawing technique, were outlined. 


Received January 6, 1956. 
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A Comparison of Productivity and Fantasy on the 
Rorschach and Projective Movement Sequences"’ 


William H. Lundin and Boris Breiger 


Chicago State Hospital 


Clinicians frequently rely upon inference 
when analyzing sparse or unelaborated pro- 
jective records. Clinical utility increases when 
such records more clearly show psychody- 
namic content. Experience with the Projective 
Movement Sequences Test (PMS), a brief 
motion picture comprising eight semistruc- 
tured and ambiguous sequences, suggests that 
psychodynamic content can be increased by 
providing Ss with a moving stimulus (1). 

This assumption was tested by using cri- 
teria of productivity and fantasy equally 
relevant to the Rorschach and PMS. Criteria 
of productivity were: (a) number of cards 
or sequences rejected, (b) number of human 
movement responses, and (c) number of ani- 
mal movement responses. Total R was not 
used as a criterion since Rorschach and PMS 
response units are not strictly comparable. 
Structurally, PMS protocols more closely re- 
semble TAT stories than Rorschach records. 
Three measures of fantasy production were 
obtained: (a) total number of unconscious 
fantasies, indicated by archaic, bizarre, or 
highly personalized content, further categor- 
ized into (6) static objects or fragments of 
personalized content, i.e., “womb,” “broken 
arm,” “dead person,” and (c) descriptions of 
active elaborated processes, i.e., “embryo be- 


1 Based on a paper read at the 1956 meeting of 
the Midwest. Psychol. Ass., St. Louis, Mo. 

2An extended report of this study may be ob- 
tained without charge from William H. Lundin, 
Psychology Dept., Chicago State Hospital, Chicago 
34, Ill, or for a fee from the American Docu- 
mentation Institute. Order Document No. 4972 from 
ADI Auxiliary Publications Project, Photoduplica- 
tion Service, Library of Congress, Washington 25, 
D. C., remitting in advance $1.25 for microfilm or 
$1.25 for photocopies. Make checks payable to 
Chief, Photoduplication Service, Library of Con- 
gress. 


ing born,” “arms and legs falling off,” “can- 
cer inside my body eating away.” 

The sample consisted of 40 schizophrenics 
who were given both tests. Records were ana- 
lyzed by the sign-rank test, showing results 
significant beyond the .01 level in favor of 
the PMS on the following criteria: rejection, 
human movement, animal movement. Total 
number of personalized fantasies did not dif- 
fer significantly; however, number of “active 
elaborated processes” was significantly greater 
on the PMS (.01), and number of “static 
objects” was significantly greater on the Ror- 
schach (.01). In the sample, 32.5% of Ss re- 
jected more Rorschach cards than PMS se- 
quences, 60% saw more human movement on 
the PMS, 75% saw more animal movement 
on the PMS, 62.5% had more personalized 
fantasies on the PMS, 82.5% had more static 
fantasies on the Rorschach, and 90% saw a 
greater number of elaborated fantasies on 
the PMS. Those Ss who rejected Rorschach 
cards responded to every PMS sequence. 

Several hypotheses are suggested: (ca) 
stimuli in motion lower the threshold for M 
type responses, (6) many times the Ror- 
schach reflects the S’s affective dissociation 
and gives no clear evidence for the uncon- 
scious fantasies underlying his behavior, (c) 
for most of these Ss the PMS seems to give 
the necessary “push” towards elaborating 
fantasies and revealing affect which the cli- 
nician is then able to interpret in an un- 
ambiguous manner. 


Brief Report. 
Received May 11, 1956. 
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A Continuous Performance Test of Brain Damage”’ 


H. Enger Rosvold,* Allan F. Mirsky,’ Irwin Sarason,* 
Edwin D. Bransome, Jr.,° and Lloyd H. Beck * 
Yale University 


This paper presents the results of an in- 
vestigation using a new instrument for the 
study of brain damage in human subiects. 
The design of the instrument, the Continuous 
Performance Test (CPT) was based on cer- 
tain electroencephalographic evidence which 
suggested that brain-damaged individuals 
should show inferior ability as compared with 
non-brain-damaged individuals on tasks re- 
quiring sustained attention or alertness. 

The waking EEGs of brain-damaged pa- 
tients generally show either random bursts of 
hypersynchronous (high amplitude) activity 
intruding upon the normal activity of the 
brain, or a general hypersynchrony (3, 8). 
Hypersynchronous activity is also evident in 
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tion to the following individuals and institutions for 
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ter Landmesser and Russell Fuldner and the New- 
ington Crippled Children’s Hospital, Newington, 
Connecticut; Dr. Rhoades and the New Haven 
Cerebral Palsy Clinic, New Haven, Connecticut; 
Drs. Milton Senn, William German, Janice Stevens, 
Morris Wessell, Ethelyn Klatskin, and the staff of the 
several pediatric and neurological clinics of the Grace- 
New Haven Community Hospital, New Haven, Con- 
necticut. 

2Supported in part by grants to H. Enger Ros- 
vold from the Veteran’s Administration, Contract 
VA1001-M3222 and the National Science Founda- 
tion. 

83 Now at the National Institute of Mental Health, 
Bethesda, Maryland. 

*Now at the West Haven V.A. Hospital, West 
Haven, Connecticut. 

5 Now at the College of Physicians and Surgeons, 
Columbia University, New York, N. Y. 

6 Now at the University of Michigan, Ann Arbor, 
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the recording from the brain of a sleeping 
subject (3). If hypersynchrony is associated 
with reduced vigilance or attention, as sug- 
gested by its presence during sleep, then the 
hypersynchrony of the brain-damaged pa- 
tients might also indicate reduced attention. 
For example, according to Hebb (1), hyper- 
synchrony might interfere with the sequen- 
tial firing of cell assemblies and thereby dis- 
rupt the process of attention. Other research 
strongly suggests that some such relationship 
does in fact exist. Thus, on the basis of a 
study of 75 prefrontal lobotomy cases, Levin 
states: 


Immediately following operation the patient usu- 
ally lapsed into a drowsy-anergic state from which 
he could be temporarily aroused by stimuli, only to 
lapse once again into somnolence when the stimuli 
were removed. During this drowsy-anergic state 
EEGs showed diffuse slow activity and a good deal 
of baseline oscillations of 4 to 2 per second an- 
teriorly mixed with slow rhythms of 2 to 6 per 
second frequency. As the patient emerged from the 
drowsy stage the preoperative pattern usually as- 
serted itself (2, p. 422). 


The usual measures of attention or alert- 
ness such as the digit span and digit symbol 
substitution subtests of the Wechsler-Bellevue 
have not consistently showed decline follow- 
ing brain damage (4, 5, 6, 7). In the case of 
the patient who shows intermittent bursts of 
hypersynchronous activity, and hence only 
momentary lapses in attention, the test per- 
formance may be due to the fact that the S 
can to a great extent choose his own time to 
respond, and may reorganize his attention be- 
tween momentary lapses. The lapses would 
then not affect his score to any measurable 
extent. 

In the case of the individual who shows a 
generalized hypersynchrony of the electrical 
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activity of the brain, the classical tests might 
not reflect his deficiency because they do not 
require sustained attention over a sufficiently 
long period of time. 

These considerations suggested that a test 
which would not allow the patient to choose 
his own time to respond, and which would 
require a high level of continuous attention 
over an appreciable interval of time might 
reflect a deficit that other procedures would 
miss. Accordingly, the CPT was designed to 
provide two attention tasks, labeled X and 
AX, with AX designed to be the more diffi- 
cult. This paper presents data on the CPT 
performance of comparable groups of brain- 
damaged and non-brain-damaged individuals. 

If the research demonstrates that there are 
indeed differences between brain-damaged and 
non-brain-damaged groups on this test, one of 
the questions which arises is whether or not 
the differences are large enough to be of 
diagnostic use. The data also provide a pre- 
liminary answer to this question. 


Method 


Subjects. No attempt was made in this 
study to select patients on the basis of either 
locus, extent, or type of brain damage. The 
term “brain-damaged” is here used to in- 
clude any individual for whom there was 
medical evidence (surgical notes, neurologi- 
cal examination, EEG) of brain pathology. 
Three groups were studied, each consisting 
of a brain-damaged and a non-brain-dam- 
aged subgroup. The Defective group consisted 
of 72 institutionalized feebleminded Ss, who 
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were diagnosed as either of “organic etiology” 
(the Brain-Damaged-Defective subgroup) or 
of “familial or idiopathic etiology” (the Non- 
Brain-Damaged-Defective subgroup). The 
Child group comprised 45 children from a 
children’s hospital and from various pediat- 
ric clinics who were being treated for brain 
disorders such as cerebral palsy (the Brain- 
Damaged-Child subgroup) or for non-nerv- 
ous-system disorders (the Non-Brain-Dam- 
aged-Child subgroup). The third group, 
designated the Adult group, consisted of 50 
adult Ss. Half of these (the Brain-Damaged- 
Adult subgroup) were either epileptics being 
treated at a seizure clinic or brain surgery 
patients being seen at a neurosurgery clinic. 
The remaining 24 Ss (the Non-Brain-Dam- 
aged-Adult subgroup) were either part of the 
general medical population of a hospital or 
were nonpatient Ss. 


The age and a measure of intelligence were ob- 
tained for all Ss. In addition, the length of institu- 
tionalization was obtained for the Defective group. 
The Ss in the Defective group were routinely tested 
on the Stanford-Binet upon admission to the train- 
ing school; all other Ss were tested for IQ just prior 
to presenting them with the CPT. The intelligence 
measure used for the Child group was the Stanford- 
Binet vocabulary, while the Ss in the Adult group 
were tested on either the Full Scale Wechsler-Belle- 
vue, the Verbal Scale of the Wechsler-Bellevue, or the 
20-minute modification of the Otis Self-administer- 
ing Test cf Mental Ability, Form B. Approximately 
equal numbers of Ss in each of the Adult subgroups 
were tested on each of the three measures of intelli- 
gence. The average ages and IQs of the several 
groups are presented in Table 1. The mean age of 
the Non-Brain-Damaged-Child subgroup was sig- 
nificantly higher than that of the Brain-Damaged- 


Table 1 
Mean Ages and IQs of the Subjects 











Group N Meanage SDage MeanIQ SDIQ 

Defectives 

Brain-damaged 29 27.3 8.1 63.3 7.8 

Non-brain-damaged 43 25.1 7.8 66.8 7.5 
Children 

Brain-damaged 19 9.0** 3.0 101.9 28.2 

Non-brain-damaged 26 12.3** 2.9 104.6 21.2 
Adults 

Brain-damaged 25 31.7 12.1 90.5* 15.2 

Non-brain-damaged 25 31.5 13.9 100.0* 15.0 





* Indicates a significant difference (p < .0 


5). 
** Indicates a significant difference (p < .01). 
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Fig. 1. Semischematic diagram showing the Con- 
tinuous Performance Test apparatus, a subject and 
examiner in the positions they would occupy during 
a test. 


Child subgroup (p < .01). Also, the IQ of the Non- 
Brain-Damaged-Adult subgroup was significantly 
higher than that of the Brain-Damaged-Adult sub- 
group (p< .05). There were no other significant 
differences between the various paired subgroups 
with respect to age or IQ. The mean length of in- 
stitutionalization for the Brain-Damaged-Defective 
subgroup was 11.4 years, for the Non-Brain-Dam- 
aged-Defective subgroup, 9.3 years. This difference 
was not significant. 


Apparatus. The testing instrument is illus- 
trated in Fig. 1. It consisted essentially of a 
revolving drum on which two series, each of 
31 letters, were mounted side by side. 


One series served for the X task, the other for 
AX task. The drum revolved slowly, approximately 
twice a minute, in a boxlike case equipped with a 
visor through which the S was required to look. The 
E shifted the task from X to AX by changing the 
horizontal position of the visor. The letters were 
illuminated briefly one at a time. The illumination 
was provided by a bank of five 0.5-watt neon bulbs 
mounted inside the case, fired by means of the dis- 
charging of an 8-microfarad condenser. The S per- 
formed by pressing a response key when certain 
“correct” letters appeared. In the X task, every X 
was a correct letter; in the AX task, every X fol- 
lowing an A was a correct letter. The letters came 
into view at approximately 0.92-sec. intervals. A re- 
sponse was scored correct if the response key was 
pressed within 0.69 seconds after the brief illumina- 
tion of the correct letter. Following this interval, 
there was a period of approximately 0.23 seconds 
before the next letter was illuminated. Correct and 
incorrect responses were recorded automatically on 
a set of counters facing the back of the machine 
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where E sat. The X task was always presented first 
and included eight X’s in the series of thirty-one let- 
ters. The responses occurring over the first two revo- 
lutions of the drum served as practice and were not 
counted in the S’s score. The Ss were then given 
either a 5-minute test (10 revolutions) in which 
there were 80 possible correct X responses, or a 10- 
minute test (20 revolutions) in which there were 
160 possible correct responses. Following a 2-minute 
rest period, the visor was moved by E so that the 
AX series was in view. There were six correct AX 
sequences among the thirty-one letters of the AX 
series. As with the X task, the responses occurring 
during the first two drum revolutions served as 
practice. Then, the Ss received either a 5-minute 
test in which there were 60 possible correct AX re- 
sponses, or a 10-minute test, in which there were 
120 possible correct. 


Procedure. No attempt was made to keep 
the instructions completely identical from S$ 
to S; the typical form, however, did not vary 
greatly from this content: 


“Do you see this viewer? When the machine starts, 
you will see letters appearing one at a time. Your 
job is to press this key every time you see an X. 
Don’t press it for any other letter and always press 
it when you see an X. Press the key down for a bit 
when you see an X; don’t hit it too quickly. Try 
holding your eyes different distances from the viewer 
until you find the one that is most comfortable for 
you. Remember, press the key every time you see an 
X, but not for any other letter.” Those Ss who had 
difficulty in understanding were given further in- 
struction of the same sort. 


With the X task and the two-minute rest 


period completed, the S was instructed as 
follows: 


“Your job in this next part is again to press the 
key when you see an X, but this time, only if the 
X follows right after an A. That is, when you see 
the series A—X— press the key on the X. Don’t 
press the key for any other letter following an A 
except X, and always press it then. Remember now, 
when you see the letter A, get set; if an X comes 
right after it, press the key.” Additional instruction 
was given to those who had difficulty in under- 
standing. 








Except for the children, all Ss were tested 
for 10 minutes on each of the two tasks. With 
children, the time was reduced to 5 minutes 
on each task, and in addition, the instructions 
were modified to some extent, presenting the 
task as a game. For the children under the 
age of 10, pretesting was carried out to make 
certain that they could discriminate letters, 
and to see whether they could follow the in- 
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structions by requiring answers to two spe- 
cial test sheets, one containing Xs among 
other letters, the other, As and Xs among 
other letters. Those who failed this pretest 
were not tested further. No S was tested who 
could not perform the minimal motor tasks 
of holding his head up unsupported and press- 
ing the response key. Since the testing was 
conducted at many different clinics and hos- 
pitals, it was not feasible to control exactly 
the amount of illumination in the testing 
room. The ordinary lighting of the room was 
used without any modification; care was 
taken however, to insure that there was no 
glare on the visor of the testing apparatus. 
Reliability. Two measures of the reliability 
of the CPT were made, using Ss of the De- 
fective group. Odd-even reliability was deter- 
mined from the scores of 21 non-brain-dam- 
aged Ss, by comparing the subtotal of correct 
responses on the 10 even revolutions of the 
drum with the subtotal on the 10 odd revolu- 
tions. The Pearson r for the X task was 0.88, 
for the AX task 0.86. To determine test-re- 
test reliability, 21 brain-damaged and 22 non- 
brain-damaged Ss were retested from four to 
seven weeks after the initial testing. The test- 
retest r for the number of correct responses 
of the brain-damaged group on X was 0.90, 
for AX, 0.79. The test-retest r for the num- 
ber of correct responses of the non-brain- 
damaged group on X was 0.88, for AX, 0.74. 
CPT scores. The X and AX responses of 
each S$ were scored in two ways. The absolute 
percentage correct was defined as the number 


of correct responses the S made divided by 
the number of correct responses (X 100) pos- 
sible in the time allowed. The relative per- 
centage correct was defined as the number of 
correct responses (xX 100) the S$ made di- 
vided by his number of attempts (both cor- 
rect and incorrect) during the time allowed. 
In the event that the two scores differed 
markedly, depending on the S’s pattern of re- 
sponding, the two measures provided a more 
complete description of his performance than 
would either of the two taken singly. 


Results 


CPT scores. Table 2 presents the means 
and standard deviations of the absolute per- 
centage of possible correct responses on the 
X and AX tasks for the six subgroups. Since 
there was a significant age difference between 
the Child subgroups and a significant IQ dif- 
ference between the Adult subgroups (Table 
1), the differences between the paired sub- 
group means on X and AX in these two 
groups were evaluated by means of an analy- 
sis of covariance. The means of the Child 
group in Table 2 have been adjusted for re- 
gression on age and the means of the Adult 
group have been adjusted for regression on 
IQ. The covariance procedure was corrected 
for heterogeneity of variance between the 
subgroups when the analysis required it. The 
mean differences in the Defective group were 
evaluated by means of the ¢ test. 

The differences between the subgroup means 
were highly significant on both the X and 


Table 2 


Adjusted Means and Standard Deviations for Child and Adult Groups, Means and Standard Deviations for 
Defective Groups, X and AX Absolute Percentage Correct Measures 

















Brain-damaged Non-brain-damaged 
Adj. Adj. 
Group N mean SD N mean SD p diff. 

Children 

xX 19 41.61 28.39 26 82.57 14.65 <.0005 

AX 19 46.46 28.20 26 79.41 17.44 <.0005 
Adults 

xX 25 78.53 28.26 25 88.00 17.21 >.30 

AX 23 61.40 26.66 23 78.77 15.33 <.01 
Defectives 

X 29 46.44 32.02 43 67.86 25.62 <.01 

AX 29 32.30 25.79 43 54.74 20.02 <.0005 
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Table 3 


Significance Levels of Differences Between X and AX 
Scores, for Relative and Absolute Measures 
of Performance, All Groups 
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Table 4 


Significance Levels of Differences Between Adjusted 
AX Mean Scores of Brain-Damaged and Non- 
Brain-Damaged Subgroups, All Groups 














Non- 
Brain-«/amaged brain-damaged 
Group Absolute Relative Absolute Relative 
Children >.70 >.30 >.50 >.70 
Adults <.001 <.01 <.01 <.05 
Defectives <.01 <.001 <.001 <.001 


Performance measure 





Group Absolute Relative 
Children >.20 <.05 
Adults <.01 >.05 
Defectives <.05 <.05 





AX tasks in the Defective group (p < .01 
and p < .005, respectively) and also in the 
Child group (~ < .0005 in each case). In the 
Adult group, the brain-damaged subgroup 
was significantly inferior to its controls on 
AX (p< .01), but not on X (p > .30). 

When the relative percentage scores of the 
paired Child and Adult subgroups were com- 
pared by means of an analysis of covariance, 
and those of the Defective group by means 
of the ¢ test, the paired subgroup differences 
were all in the same direction and of approxi- 
mately the same magnitude as those reported 
in Table 2. The levels of significance were in 
no case less than the corresponding levels re- 
ported in Table 2. The only difference was 
that the relative scores on the X task dis- 
criminated significantly (p< .05) between 
the Adult subgroups, whereas the absolute 
scores on the X task (Table 2) did not. 

In order to determine if the AX task was 


in fact more difficult than the X task, the 
mean X score was compared with the mean 
AX score for each of the six subgroups. 
Table 3 presents the significance levels from 
the ¢ test of the differences between X and 
AX scores for both the relative and absolute 
measures of performance in all six subgroups. 
Each of the Defective and Adult subgroups 
performed significantly worse (at p < .05 or 
less) on the AX than on the X task. With the 
children, none of the differences between X 
and AX was significant. To determine if the 
brain-damaged subgroups had relatively more 
difficulty on AX (i.e., independent of their 
performance on X) than the non-brain-dam- 
aged subgroups, an analysis of covariance 
was run on the AX scores of the paired brain- 
damaged and non-brain-damaged subgroups 
with X scores as the covariate. Table 4, 
which presents the results of the analysis, 
shows that in terms of the absolute measures 


Table 5 


Percentages of Correct Identification of Diagnostic Category by CPT Scores, All Groups 


























Group 
Defectives Children Adults 
Brain- Non-brain- Brain- Non-brain- Brain- Non-brain- 
Subgroup damaged damaged damaged damaged damaged damaged 
N 29 43 19 26 25 25 
X—R as 67.4** 84.2*** 76.9*** 68.0* 68.0* 
X—A 62.0 58.1 89.5*** 76,9*** 64.0 64.0 
AX—R 72.4 65.1* 84.2*** 76.9*** 66.7* 66.7* 
AX—A 75.9*** 67.4** 84.2*** 76.9*** 65.0T 65.0f 
*p < .05 
*> < 01 
> < .005 


Note.—-values refer to the probability that the percentage differs from chance (50%). Nonstarred percentages do not differ 


significantly from chance. 








of performance, the Brain-Damaged-Adult 
and -Defective subgroups had significantly 
greater difficulty on the AX task than their 
respective nondamaged subgroups, even when 
the paired subgroups were adjusted to equal 
levels of performance on the X task. Simi- 
larly, in terms of the relative measures of 
performance, the Brain-Damaged-Child and 
-Defective subgroups had significantly greater 
difficulty on AX than their respective non- 
damaged subgroups. 

Correct identification of subgroup. Table 5 
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presents the percentages of correct identifica- 
tion of diagnostic category of all the sub- 
groups on all measures yielded by the CPT. 
These were determined for each subgroup on 
each CPT measure from the over-all median 
of scores of a group on that measure. For ex- 
ample, considering the X scores in the De- 
fective group, the percentage of brain-dam- 
aged Ss scoring below the combined Defec- 
tive group median defines the percentage of 
correct identification of brain-damaged defec- 
tives on the X task. Similarly, the percent- 
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Fig. 2. Frequency distributions of subjects by percentage correct score intervals, all groups, all CPT 
measures. Black columns represent frequencies of brain-damaged subjects, white columns, frequencies 
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age of non-brain-damaged defectives scoring 
above the over-all median defines the per- 
centage of correct identification of non-brain- 
damaged defectives on the X task. These 
percentages were computed to provide an 
estimate of the potential usefulness of the 
several scores o! the CPT as diagnostic indi- 
cators in various clinical groups. In Table 5, 
X-R and AX-R refer to the relative meas- 
ures of performance on these tasks; X-A and 
AX-A refer to the absolute measures of per- 
formance. The relative measures provided 
more significant percentages of correct identi- 
fication than did the absolute measures; all 
of the six X-R and all of the six AX-R sub- 
group percentages were significant at p < .05. 
The Child group had more significant per- 
centages of correct identification than any 
other (eight of eight) all of which were sig- 
nificant at p < .005. There were equal num- 
bers of significant percentages of correct 
identification among the non-brain-damaged 
and brain-damaged subgroups; however, these 
percentages tended to be higher among the 
brain-damaged subgroups. 

Figure 2 presents for each group and CPT 
measure, the distribution by percentage score 
of brain-damaged and non-brain-damaged Ss 
(uncorrected for age or IQ differences be- 
tween subgroups). Figure 2 emphasizes in 
general the great range in scores achieved by 
brain-damaged Ss as compared with non- 
brain-damaged Ss. The scores of the non- 
brain-damaged Ss are usually concentrated 
near the upper end of the distribution; in 
fact, the modal score for non-brain-damaged 
Ss is in the 91 to 100% interval in every case 
except for the AX-A measure in the Defective 
group. In contrast with this, the scores of the 
brain-damaged Ss span the entire distribu- 
tion, or the entire distribution excepting one 
score interval, in every case. 


Discussion 


The results of this study indicate that brain- 
damaged individuals perform poorly relative 
to non-brain-damaged controls on a task re- 
quiring continuous attention. The results also 
indicate that on a task designed to require 
even more sustained attention the brain-dam- 
aged individuals perform even more poorly. 
The score distributions of the various sub- 
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groups (Figure 2) and the percentages of in- 
dividuals correctly identified by diagnostic 
category (Table 5) suggest that the CPT is 
sufficiently sensitive to the effects of brain 
damage so that it might prove useful clini- 
cally. The data of Table 5 suggest that the 
relative measures are more discriminating 
than the absolute measures, that the test is 
somewhat more reliable in reflecting the 
presence than the absence of brain damage 
and that it is most discriminating when ap- 
plied to the children of this study. 

These findings are of course preliminary. 
Use of the CPT in the clinic must await 
standardization on a variety of diagnostic 
groups, to ascertain whether brain pathology 
only and which types of brain pathology 
lead to deficient performance. For example, 
some data which have not been reported here 
indicate that schizophrenics perform no dif- 
ferently from normals of comparable age and 
IQ. More information concerning other groups 
is required. Standardization on a variety of 
diagnostic groups is also necessary in order 
to have a reliable diagnostic score for each 
subgroup. Thus, in the present study, cutoff 
points which included most brain-damaged 
children missed many brain-damaged adults 
(see Figure 2). 

Diagnostic considerations aside, the ques- 
tion remains as to whether the impaired per- 
formance of the brain-damaged Ss on the 
CPT is in fact due to an impairment of at- 
tention, whether in the form of momentary 
lapses, or of a lowering of a general level. 
Clearly, the available information does not 
rule out alternate interpretations of the defi- 
cit. To make certain that it is attention that 
is impaired would require more extensive in- 
formation about the S’s test behavior than 
the present instrument provides. It is clear, 
furthermore, that the available information 
does not permit the conclusion that the im- 
paired CPT performance is indeed related to 
hypersynchrony as evident in the EEG. 
Nonetheless, the results of this study do lend 
support to these hypotheses suggesting that it 
might now be profitable to make a more di- 
rect test of the original formulation. This test 
would be provided by simultaneously measur- 
ing CPT performance and brain activity to 
determine whether errors on the CPT are 











350 H. E. Rosvold, A. F. Mirsky, I. Sarason, E. D. Bransome, and L. H. Beck 


coincident in time with hypersynchronous 
patterns on the EEG and whether there is a 
correlation between degree of deficit on the 
CPT and degree of hypersynchrony. 


Summary 


1. The Continuous Performance Test, a 
procedure for the detection and study of brain 
damage in humans, is described. 

2. Three groups of Ss, each including a 
brain-damaged and a non-brain-damaged sub- 
group, were tested on this procedure. 

3. The brain-damaged subgroups were sig- 
nificantly inferior to their non-brain-damaged 
controls on the measures yielded by the CPT, 
and these differences were increased when the 
difficulty of the task was increased. 

4. The CPT is sufficiently reliable and 
yields sufficiently large differences between 
subgroups to suggest that it might ultimately 
prove useful as a clinical instrument for the 
diagnosis of brain damage. 

5. An interpretation of the inferior per- 
formance of the brain-damaged Ss was offered 
in terms of impairment in attention or alert- 


ness and suggestions were made about future 
research relating cerebral events and CPT 
performance. 


Received January 13, 1956. 
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Stimulus Structure as a Determinant of the 
Perceptual Response’ 


M. Powell Lawton 


Veterans Administration Hospital, Providence, R. I. 


The role of formal stimulus characteristics 
is acknowledged to be of importance in the 
interpretation of projective test data. Clarifi- 
cation of the meaning of projective test re- 
sponses has been sought through study of 
stimulus qualities in a number of areas (1, 2, 
6, 8, 12, 14, 15). 

With respect to the Rorschach test, the role 
of form in the production of a response has 
received surprisingly little attention. The con- 
cept of the popular response implies that for- 
mal blot characteristics of certain areas of 
the Rorschach tend to call forth a particular 
response. Ranzoni, Grant, and Ives (10) ob- 
tained a set of norms for an adolescent popu- 
lation specifying which cards were most likely 
to elicit responses using a particular location, 
determinant, and content. Their study was 
not concerned primarily with form, however, 
and made no attempt to vary the forms of 
the Rorschach stimuli or to place them in an 
order of complexity. Meer’s study (9) estab- 
lished a tentative order of organizational diffi- 
culty among cards, but did not attempt to 
specify the factors believed to be responsible 
for stimulus ease or difficulty. Stein’s (11) 
tachistoscopic exposure of the cards demon- 
strated that perception was less adequate un- 
der impoverished stimulus conditions but 
again, the stimulus condition as a whole was 


1 Adapted from a dissertation submitted in partial 
fulfillment of the requirements for the degree of Doc- 
tor of Philosophy in the Faculty of Philosophy, Co- 
lumbia University, 1952. The author wishes to thank 
Dr. E. J. Shoben, Jr., Dr. Helen Waiker, and Dr. 
Laurance Shaffer, members of his committee, for 
their help, and also Dr. Bernard Locke, Chief Psy- 
chologist, New York VA Hospital, for making pos- 
sible the gathering of the data. 


varied, with no attempt to focus on form as 
the variable for study. 

The present study investigated by means 
of a non-Rorschach perceptual task the gen- 
eral hypothesis that stimulus ambiguity is an 
important determinant of perceptual ade- 
quacy. Degree of maladjustment was a sec- 
ond variable studied in line with Forer’s (5) 
suggestion that degree of personality integra- 
tion is a determining factor in the differential 
adequacy with which an individual perceives 
in situations of varying ambiguity. 

It was hoped that variation of the form of 
the stimulus, isolated from as many other 
stimulus characteristics as possible, would 
provide a general basis upon which to in- 
terpret individual projective records, though 
the limits of applicability to any specific test 
are fully realized. As in Stein’s (11) study, 
response characteristics were evaluated in 
terms of perceptual adequacy, defined so as 
to include both formal and content charac- 
teristics of the response. The specific hypothe- 
sis was that both increasing stimulus am- 
biguity and increasing degree of pathology 
would result in decreased perceptual ade- 
quacy. 

Procedure 


In order to limit investigation as much as 
possible to form alone, sixty figures were ran- 
domly cut from black paper and mounted on 
white cards, leaving only the blackness of the 
figure as a confounding factor. Seventeen psy- 
chologists rated each figure as most, least, or 
moderately structured, using as their criterion 
of structuredness the relative ease with which 
each figure suggested a percept. From these 
judgments five figures were clearly chosen as 
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the most structured (Structure I), five as 
moderately structured (Structure II), and five 
as least structured (Structure III). The 15 
figures were arranged in random order for 
standardized presentation to the subjects. 
This “subjective consensus” method of defin- 
ing stimulus ambiguity, though it has the 
disadvantage of risking bias through the use 
of only psychologists as judges, affords a 
definition of ambiguity in psychological, 
rather than physical, terms. The psychologi- 
cal definition was deemed more appropriate 
to the present study. 

Thirty Ss were chosen from each of three 
nosological groups: neurotic, latent schizo- 
phrenic, and hospitalized schizophrenic, as 
diagnosed either by their outpatient thera- 
pists or ward physicians. By using this or- 
dinal scale of degree of maladjustment, it was 
hoped that unreliability in ratings of malad- 
justment could be avoided. The three groups 
were matched for age and education. Analy- 
sis of variance and Bartlett’s test showed no 
reason to suspect that either means or vari- 
ances differed significantly. The mean age of 
the entire group was 29.8 years, mean edu- 
cational level 11.4 years. 

The cards were presented in standard po- 
sition and Ss were instructed to tell the ex- 
aminer what the whole figure could look like. 
Turning and detail responses were discour- 
aged, though accepted if S could not respond 
in the requested manner. Only the first re- 
sponse to each figure was used. Through 
judicious urging only five rejections out of a 
possible 1,350 responses occurred. The S was 
asked to describe each response further. This 
was the examiner’s only question, other than 
clarification of the areas used. 


Table 1 


Number of Patients Receiving All “J” Ratings 
on Form Level 








Stimulus Ambiguity 








Group I II Ill 
Neurotics 28 19 14 
Latent schizophrenics 25 13 8 
Hospitalized schizophrenics 17 3 1 





The following categories were employed in 
evaluating the perceptual quality of the re- 
sponses: 

Form Level: a four-point rating scale measuring 
judged goodness of fit of percept to stimulus. Scale 
points were named J. adequate form, 2. vague form, 
3. poor form, and 4. very poor form. 

Structural Peculiarities: (a) Fabulation was a re- 
sponse which added a great deal more to the percept 
than could reasonably be assumed from the stimu- 
lus (“Miss Olsen’s profile,” “a person who died of 
pneumonia.”) (b) Incongruity was a percept whose 
parts do not belong together (“a person with a 
meat cleaver attached to his head,” “a rabbit—here’s 
the opening in his shell.”) 

Sex Response: Any overt genital or anal response. 


Other content categories were scored, but 
occurred infrequently and could not be scored 
reliably. A Total Pathology rating scale was 
also used, but because of its high correlation 
with the Form Level scale, added little fur- 
ther information. 

Each response was typed and coded indi- 
vidually and the ninety responses to each fig- 
ure were scored consecutively by the writer. 
One-third of the total responses were sub- 
mitted for a reliability check to two other 


Table 2 


Number of Patients Without Structural Peculiarities, With Cochran’s Q Test and 
Combined Probability Measure 














Stimulus ambiguity 
Order Combined 
Group I i im Q ? probability probability ! 
Neurotics 29 23 26 2.36 308 1/6 114 
Latent schizophrenics 21 19 15 3.10 214 1/6 .080 
Hospitalized schizophrenics 20 19 18 48 .794 1/6 .295 
All groups 70 6 SD 5.03 082 1/6 031 
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psychologists.? Through a method described 
in detail elsewhere (7), scoring reliability was 
determined and was deemed good enough to 
permit using the writer’s scoring in the 
evaluation of the data. 


Results 


Table 1 shows the number of patients all 
of whose responses received Form Level scores 
of “1,” by clinical group and by stimulus 
structure. The number of patients receiving 
all “7” ratings decreased regularly both as 
stimuli became more ambiguous and as pa- 
thology increased. A nonparametric signifi- 
cance test was performed as follows. For nine 
cells there are 9! possible arrangements, and 
of these 9! arrangements only 42 can be 
found such that in every row the frequencies 
decrease from left to right and in every 
column decrease from top to bottom. Ac- 
cordingly, the probability of obtaining by 
chance alone data showing this regular de- 
crease in Form Level with increasingly am- 
biguity and with increasing degree of illness 
is 42/9!, or .0001156. 

Table 1 suggested the possibility that the 
rate of decline in Form Level as stimuli be- 
come more ambiguous might be greater as 
psychopathology increases. This supposition 
was tested in the following manner. A rate of 
decline score was obtained for each S$ by sub- 
tracting the sum of his Form Level scores for 
Structure I from the sum of his Form Level 
scores for Structure III and dividing this dif- 
ference by the sum of Structure I and Struc- 
ture III Form Level sums. The mean rate of 
decline score for neurotics was .0759, for 
latent schizophrenics .1223, and for hospital- 
ized schizophrenics .1935. An analysis of 


2 Dr. Gerald Bauman and Dr. Melvin Kornreich 
most kindly acted as judges. 
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Table 3 


Number of Patients Who Give No 
Structural Peculiarity 











Group Number* 
Neurotics 25 
Latent schizophrenics 11 
Hospitalized schizophrenics 12 





* Chi square = 7.61; 9 < .05. 


variance resulted in an F value of 4.95, sig- 
nificant at the .01 level for 2 and 87 degrees 
of freedom. 

The incidence of Structural Peculiarities is 
shown in Table 2. Columns I, II, and III in- 
dicate the number of patients who did not re- 
spond with a Structural Peculiarity to any 
one of the five figures of each structural 
level. In order to test the hypothesis that 
incidence of Structural Peculiarities is a 
function of stimulus ambiguity, Cochran’s Q 
test (3), a variant of chi square that does 
not require independence of observations, was 
used. None of the probabilities (Column six, 
Table 2) associated with Q values reaches 
statistical significance, but when account is 
taken of the fact that the predicted order 
across rows was obtained in each case (13), 
the probability value for the entire sample is 
seen to be .031. The usual chi-square test ap- 
plied to the relationship between degree of 
maladjustment and incidence of Structural 
Peculiarities (Table 3) gave a chi-square 
value of 7.61, associated with a probability 
value less than .05. Inspection reveals that 
the significance of this measure is derived 
largely from the relative absence of Struc- 
tural Peculiarities in the neurotic group. 

Table 4 indicates the number of patients 
showing no Sex responses to cards of differ- 
ent stimulus structures. Again using Cochran’s 


Table 4 


Number of Patients Without Sex Responses, With Cochran’s Q Test and 
Combined Probability Measure 














Stimulus ambiguity 
Order Combined 
I II Ill Q p probability probability 
All groups 86 84 81 3.80 142 1/6 053 
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Table 5 
Number of Patients Who Give No 
Sex Response 
Group Number* 
Neurotics 29 
Latent schizophrenics 22 
Hospitalized schizophrenics 25 





* Chi square = .98; p > .05. 


Q test and combining probabilities, the asso- 
ciation between stimulus structure and inci- 
dence of Sex responses is seen to be of border- 
line significance (p = .053). 

Table 5 indicates the number of patients 
giving no Sex responses. The chi-square value 
of .98 is associated with a probability greater 
than .05. Accordingly the hypothesis that 
incidence of Sex responses is a function of 
degree of maladjustment must be rejected. 


Discussion 


Perceptual distortion as measured by form 
accuracy is thus seen to be a function of both 
stimulus ambiguity and degree of maladjust- 
ment. On the other hand, perceptual distor- 
tion as measured by indices relating to con- 
tent and the organization of content show a 
highly tenuous, if at all significant, relation- 
ship to stimulus ambiguity and to degree of 
maladjustment. 

It appears clear, then, that it is important 
to consider stimulus characteristics in the in- 
terpretation of a projective response. In the 
Rorschach, for example, a response judged to 
be of poor form quality would have different 
meaning depending upon whether it occurred 
to a highly structured area or to a relatively 
amorphous area. As indicated in Table 1, a 
poor form response to the most ambiguous 
stimuli would give less information about the 
presence of a schizophrenic process in an in- 
dividual than would a poor form response to 
a highly structured stimulus: the original data 
show that four neurotics gave ten definitely 
poor form responses to the less well-struc- 
tured cards, but none to the best structured. 
It can be inferred that any response of less 
than adequate form level to a popular area of 
the Rorschach cards should make the in- 
terpreter suspicious of a severe disturbance, 
whereas an accumulation of F — responses on 


less well-structured areas would be required 
to give the same impression. The present re- 
sults would tend to confirm Meer’s (9) sup- 
position that an important factor in deter- 
mining Rorschach card difficulty is stimulus 
ambiguity. A problem for further research 
might involve the development of a more 
discriminating Rorschach F+% than is 
presently used by weighting more heavily 
those F — responses that occur to highly 
structured areas. 

On the other hand, the equivocal results 
obtained in the attempt to relate Structural 
Peculiarities and Sex responses to stimulus 
structure make interpretation somewhat diffi- 
cult. It may be that there is actually a 
greater contribution of internal factors than 
of stimulus factors in the production of these 
content-related distortions, though the pres- 
ent study does not answer this question. This 
supposition is reasonable if one considers that 
standards of what is appropriate in matters 
of content are largely internal, in distinction 
to the Form Level dimension, where the 
model against which the appropriateness of 
the S’s association is tested is physically 
present in the stimulus. 

The differential rates of decline in Form 
Level with increasing ambiguity among the 
three clinical groups offers experimental verifi- 
cation of the observation that the more mal- 
adjusted individuals “go to pieces” more 
easily than the less maladjusted. Not only is 
there a difference in adaptive ability, but this 
difference becomes wider as the external situa- 
tion becomes more demanding, in the sense of 
forcing the individual to provide structure to 
an ambiguous field from within himself. Such 
a finding would suggest that the best dif- 
ferentiation among groups according to de- 
gree of maladjustment would come from rat- 
ing the Form Level of the more ambiguous 
cards. This supposition was tested by Feld- 
man, Gursslin, Kaplan, and Sharlock (4) 
who found that an F + % which excluded 
popular responses from consideration was 
more discriminating between normals and 
neurotics than Beck’s or Hertz’s F + %. 


Summary 


As a means of exploring the Form Level 
dimension in projective perception, a method 
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was devised to test the hypothesis that both 
increasing stimulus ambiguity and increasing 
personality maladjustment would have an ad- 
verse effect upon perceptual adequacy. Black 
paper cutouts of three ranked degrees of am- 
biguity were presented in Rorschach fashion 
to three groups of patients representing three 
degrees of maladjustment. It was found that 
both greater stimulus ambiguity and greater 
personality maladjustment were associated 
with lesser perceptual adequacy, as measured 
by Form Level. Perceptual adequacy as meas- 
ured by several indices involving response 
content and organization of content was seen 
to have an equivocal relationship to stimulus 
ambiguity and to personality maladjustment. 
It was concluded that stimulus characteristics 
are often important determinants of projec- 
tive responses and that consideration of form 
ambiguity is necessary to the best interpreta- 
tion of projective records. 


Received November 21, 1955. 
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Manifest Anxiety and the Palmar Perspiration Index’ 


Allen D. Calvin, F. J. McGuigan, Sybil Tyrrell, 
and Marshall Soyars 
Hollins College 


During the last few years the Taylor A 
scale has become an increasingly important 
measure of manifest anxiety. During the same 
period Mowrer and others from the Univer- 
sity of Illinois have been developing the 
Palmar Perspiration Index (PPI) as a meas- 
ure of psychological tension. The present 
study is an attempt to investigate the rela- 
tionship between the A scale and the PPI. 

The A scale and the higher form of the 
Otis were given to 54 female undergraduate 
students. In order to create a stress situation 
the Ss were called in for individual experi- 
mental sessions and given the Wechsler digit 
span. Immediately after completing the digit 
span a PPI was obtained. After approximately 
three months Ss were asked to fill out a six- 
point self-rating anxiety scale. 

Product-moment correlation coefficients were 
obtained between the following: A scale and 
PPI — .01; A scale and digit span — .05; 
digit span and PPI — .16; Otis and A scale 
— .02; Otis and PPI .22. None of these was 
significant. A biserial correlation of .35 (p 
< .05) was obtained between self-ratings and 
A scale scores, and a biserial of .17 (not sig- 


1An extended report of this study may be ob- 
tained without charge from Allen D. Calvin, Hollins 
College, Va., or for a fee from the American Docu- 
mentation Institute. Order Document No. 4849 from 
ADI Auxiliary Publications Project, Photoduplica- 
tion Service, Library of Congress, Washington 25, 
D. C., remitting in advance $1.25 for microfilm or 
$1.25 for photocopies. Make checks payable to 
Chief, Photoduplication Service, Library of Congress. 


nificant) was obtained between self-ratings 
and the PPI. 

The failure to find a correlation between 
A-scale scores and PPI scores was unex- 
pected since at first glance both seem to be 
measuring somewhat the same variable. How- 
ever Beam (1) in a study published after the 
completion of our experiment, reports that he 
too found no significant relationship between 
the PPI and A scale. He points out that his 
Taylor scores fell within the middle range 
and were fairly homogeneous. Our A-scale 
scores had a mean of 16.9 with an SD of 
6.9, indicating considerable scatter within our 
group. It seems highly unlikely that our fail- 
ure to obtain a significant relationship be- 
tween A scale and PPI scores can be ascribed 
to too much homogeneity within our group. 

On the basis of our findings and other 
studies of anxiety the following hypothesis is 
proposed: The A scale and the PPI measure 
two different facets of behavior. Where anx- 
iety is of a temporary “ego-involved” situa- 
tional nature, the appropriate measure of 
drive is the PPI. Conversely, where anxiety 
is a stable permanent personality character- 
istic of the individual the appropriate meas- 
ure of drive is the A scale. 


Brief Report. 
Received April 26, 1956. 
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Effective Intelligence as Measured by an Unstructured 
Sentence-Completion Technique’ 


George E. Copple 
Vanderbilt University 


Numerous varieties of test material and 
test procedure have been used in assessing 
the intellectual functioning of children. Al- 
most all these procedures have involved rather 
strict structuring, and this structuring has 
ordinarily been considered a merit of the 
procedure. Perhaps because of this emphasis 
on easily scored, “right or wrong” answers, 
the possibilities of using a sentence-comple- 
tion technique to evaluate intelligence have 
been largely overlooked. The present study 
will describe a sentence-completion technique 
which has been developed as a measure of 
“effective intelligence” and will present data 
on its usefulness. 


Earlier Uses of Sentence Completions 


Beginning with Ebbinghaus (1), and best 
known from its inclusion on the Stanford- 
Binet (2), one type of sentence completion 
has long been used as an indicator of intelli- 
gence. In this type, most of the sentence is 
given, with only a connective word omitted. 
Acceptable choices for this omitted word are 
quite limited and an answer is either “right” 
or “wrong.” M. R. Trabue (3) used a simi- 
lar method in a large-scale study. He called 
his test “language scales” and it is obvious 
that knowledge of word meanings and lan- 
guage usage played the greatest part in de- 
termining a successful performance. 

The other, and more recent, use of sentence 
completions has been to evaluate personality 
or adjustment. Every survey of projective 
techniques is likely to have a section or a 


1 This article represents a revision of a paper pre- 
sented at a meeting of the Southern Society for Phi- 
losophy and Psychology in Atlanta, Georgia, on 
April 16, 1954. 
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chapter on sentence-completion techniques. 
These techniques are usually for adults, they 
consist of “loaded” stems intended to tap 
specific areas of adjustment (“My greatest 
weakness is...” or “I think that most 
mothers . . .”), and the responses are evalu- 
ated for content, particularly dynamic con- 
tent. The level of ability shown by the com- 
pletions and the range and depth of interests 
are ignored. 

The use of the sentence-completion tech- 
nique to be described in the present paper is 
different from these other uses reported in 
the literature in general purpose, technique 
of administration, and rationale. 


Rationale 


Most intelligence-test items demand little 
that is creative or even integrative of the per- 
son being tested. They provide a task or a 
problem that has a narrow range of correct 
answers. They provide a great deal of struc- 
turing and often call for a single word or 
phrase or brief explanation. Typical items 
are: “What is the thing to do if you are on 
your way to school and see that you are in 
danger of being late?” “What is the color of 
rubies?” “How are a coat and a dress alike 
or the same?” “Sky is to blue as grass is to 
.’ Undoubtedly items of this kind give 
some valid indication of a subject’s intelli- 
gence. But when a test is composed exclu- 
sively or mostly of such items, some impor- 
tant aspects of intelligence are left out. The 
individual who can readily place Brazil in 
South America, Egypt in Africa, or India in 
Asia, may not be able to respond successfully 
if asked instead to name one country in each 
of these continents. With still less structur- 
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ing, as for example in responding to the in- 
struction to give the best sentence comple- 
tion he can to the stimulus “In other coun- 
tries . . .,” he may give us something inept 
and vague such as “things are different” or 
“they are foreigners.” One limitation of virtu- 
ally all intelligence tests is just this failure 
to let the subject do enough—the failure to 
let him show us what he can do and does do 
when the stage is not too completely set. 

An example may be useful. Here are two 
nine-year-old boys who are completing the 
stimulus-stem “A piece of string. . . .” One 
completes it “is long.” The other supplies 
“can be used to make a bow and arrow.” 
This latter is obviously a far superior an- 
swer. It is most unlikely, however, that the 
first boy was ignorant of the fact which the 
second boy’s completion utilized. On a more 
conventional test item calling for this infor- 
mation, such as, “A bow is made of 
and »’ both boys would probably be 
able to answer correctly. But does not the 
second boy’s method of completing the sen- 
tence show something in addition to the pos- 
session of information? Does not this “some- 
thing” seem to be basic to many types of 
school and nonschool performance that we 
label “intelligence”? 








Administration of the Test 


The Copple Sentence Completion Test is 
composed of 60 items, only 30 of which are 
given to any one subject. Level A, consisting 
of items 1-30, is for children 5-8 inclusive. 
Level B, consisting of items 16-45, is for chil- 
dren 9-12 inclusive. Level C, with items 31- 
60, is for children 13 and older, and for 
adults. Items at the various levels differ some- 
what in difficulty, but this is not very easy to 
insure and is of minor importance. The scor- 
ing standards differ for the different levels of 
the test, so that even if an intrinsically easy 
item appears on the higher level test, it is 
more strictly scored and a better perform- 
ance is required for a comparable rating. 

Administration is always individual, with oral re- 
sponses by the subject, which are recorded by the 
examiner. The procedure enables young children or 
illiterates to respond as fully as they are able, with- 
out penalizing or limiting them in any way. It per- 
mits people in all age groups to use natural con- 
versational completions without regard to avoiding 


words they cannot spell. At the other extreme, it 
discourages “literary” completions. Experience seems 
to show that it is unnecessary to lim’+ the time a 
subject can use in framing his responses. As a rule, 
the longer the delay, the poorer the response (ex- 
cept for extremely brief latencies). 

In introducing the test, the subject is told that he 
will be given some words and that he is to add some 
more words so as to produce the very best sentence 
he can make. A sample stem is given the subject 
and two ways of completing it are presented, one 
being designated as “better” than the other. For ex- 
ample, the stem might be “The house... .” One 
way to complete it, the subject is told, would be to 
say “is large.” Another, and a better way, would be 
to say “stood in the bend of the road.” Nothing is 
said about why the latter is better, and subjects as 
young as five years accept this as a self-evident 
superiority. 

With this brief introduction, the test is begun. It 
requires about 15 minutes to complete. It is rare, 
even at age five, to have a child say “I don’t know” 
to even a single item. Few persons seem to experi- 
ence any sense of failure in completing the sentences. 
The intrinsic interest is high and no matter at what 
level the individual is performing, he is likely to 
consider that he is doing well. This feature, of course, 
is a particular virtue when a whole battery of tests 
is to be given or when a subject is to be worked 
with therapeutically. 


Scoring the Responses 


Each response is scored 0, 1, 2, or 3 in 
accordance with prepared scoring standards. 
These standards reflect such considerations 
as relevance, complexity, descriptiveness, and 
range or scope. Numerous examples have 
been provided for each score value on each 
item. Judgment is still necessary, but in nu- 
merous studies the agreement in scoring has 
been high (r = .95 or above). For example, 
in one study using 24 protocols and with 10 
psychology graduate students as scorers, none 
of whom had had any previous experience 
with the test, the median interscorer reliabil- 
ity was .97. 

The split-half reliabilities of the test itself 
have been uniformly high. For the three stud- 
ies reported below, these reliabilities (cor- 
rected by the Spearman-Brown formula) 
were respectively .94, .95, and .93. No test- 
retest data have been gathered. 


Measuring “Effective Intelligence” 
(Validity) 


The most valuable criterion for a test of 
“effective intelligence” would be the judg- 
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A New Sentence-Completion Technique 


Table 1 
CSCT Scores for Three Age Groups 











Mean 
Age Test 
Group N Score Sigma CR 
er ont Oe er 2.27 (.02 level) 
. ; 1.66 (.05 level) 
11-6 20 53.85 11.73 





ments of objective, well-trained observers on 
the actual behavioral efficiency of large num- 
bers of individuals carrying out a variety of 
tasks. Such a criterion is not available, and 
the “construct validity” of a measure of ef- 
fective intelligence must be examined by di- 
verse approaches. 

Relationship of test scores to age. Two 
studies with data relevant to the relation- 
ship between test scores and CA can be cited. 
In the first, Form B of the CSCT was ad- 
ministered to all the children in two repre- 
sentative schools in Davidson County, Ten- 
nessee, a metropolitan community, who were 
within three weeks of being 9-6, 10-6, and 
11-6 years of age. Table 1 contains the data 
for the 66 children in this study. As may be 
seen, the two older groups of the study are 
rather small. If they are combined and com- 
pared with the younger group, the difference 
in mean scores is significant well beyond the 
.001 level. 

A second study of the relationship of CSCT 
scores to age was carried out with a group of 
50 orphanage children, again using Form B 
of the CSCT. These children ranged in CA 
months from 97 to 154, with a mean of 
129.66 and a sigma of 15.7. The obtained 
correlation between chronological age and 
CSCT scores is significant at the .0001 level, 
but this is not very informative. The size of 
an r between chronological age and any 
measure of mental age will be greatly in- 
fluenced by the range and distribution of 
chronological ages included. What is really 
needed, therefore, is an indication of the rs 
between CAs and mental ages on other well- 
known tests within this same group. This is 
available on this group of 50 children for the 
Goodenough Draw-A-Man Test, but for the 
Binet, only 33 of the 50 subjects were tested. 
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Table 2 
Correlations Between Age and Scores on 
Three Different Tests 
Sample A’ 
Sample A (A plus 17 
Test (N =33) additional! 
CSCT (Form B) 584 577 
Binet (Form L) 621 
294 400 


Goodenough DAM 





All of the tests—Form L of the Binet, the 
Goodenough, and the CSCT—were adminis- 
tered within a period of five or six weeks. 
The Goodenough drawings were collected and 
scored by an advanced graduate student, and 
the same student administered the CSCT, 
which was, however, scored by the author. 
In Table 2 the correlations of each of the 
tests with ages in months are shown. In com- 
puting these correlations, mental ages were 
used for the Binet and Goodenough data. 

Relationship of test scores to scholastic 
achievement. A widely used criterion of an 
intelligence test’s validity is the correlation 
of the test with school achievement. Form B 
of the CSCT and Forms L and M of the 
Binet were given, plus the Stanford Achieve- 
ment Test, to a group of 67 orphanage chil- 
dren aged 9-13, none of whom were in the 
study described above. As a group these chil- 
dren were retarded, as shown by a Binet IQ 
range (L and M were averaged) of 67.5 to 
98. Table 3 summarizes the pertinent data 
on this group. The range of intellectual abil- 
ity and of scholastic attainment within this 
group was rather narrow, considerably less 
than would be found within a single grade in 
the typical school system. In spite of this, 
both the Binet and the CSCT had substan- 


Table 3 


Range and Variability of an Orphanage Group on Two 
Tests of Intelligence and an Achievement Test 








Test 





Range Mean Sigma 

Binet (Land M) MA 140-76 107.36 13.87 

CSCT (Form B) 69-10 39.86 10.81 
Stanford Achievement 

(grade equivalent) 5.4-1.5 3.22 87 
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tial correlations with the Stanford scores. 
The Binet correlation was .623 and the CSCT 
correlation was .542. Since the Binet MAs 
and the CSCT were not highly correlated 
with each other (.349), a multiple r between 
these two and the Stanford Achievement was 
computed and was found to be .713. While 
the Binet was slightly more successful than 
the CSCT in predicting Stanford Achieve- 
ment scores, it should be noted that the 
Binet scores involved were averages of two 
forms, requiring a minimum testing time of 
two hours for each child, while the CSCT 
required about fifteen minutes for each child. 

Relationship of test scores to IQ gains in 
therapy. One other set of correlations may 
have particular interest for clinicians. As part 
of a larger experiment designed to increase 
IQs, 55 of these orphanage children with be- 
low-average IQs were exposed to an 18-month 
program of group play therapy, with 4 chil- 
dren in each group, and of enriched cultural 
opportunities. Both L and M of the Binet 
were given prior to and again at the conclu- 
sion of therapy. The range of IQ gains was 
not spectacular, from — 3.5 to + 17, with a 
mean of + 8.26 and an SD of 4.15. There 
was no relationship (.014) between the origi- 
nal IQ level and the amount of gain. How- 
ever, the original MA level correlated .172 
with the IQ gains and the CSCT scores cor- 
related .295 with these gains. These rs are 
significant at the .11 and the .02 levels, re- 
spectively. If other corroboration of this find- 
ing is forthcoming, it is potentially impor- 
tant, for any brief procedure which can help 
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to predict the gains of retarded children from 
therapy is worth knowing about. 


Summary 


A method of testing “effective intelligence” 
by an unstructured oral sentence-completion 
technique (Copple Sentence Completion Test) 
has been described. A median interscorer re- 
liability of .97 was found in an investigation 
involving 10 scorers, and in other, smaller 
projects, the results were comparable. Split- 
half reliabilities for three studies reported in 
the present paper were .94, .95, and .93. 

Three approaches to the test’s validity were 
reported: 


1. CSCT scores compared favorably with 
Binet and Goodenough mental ages in their 
correlation with chronological age. 

2. CSCT scores compared favorably with 
Binet mental ages in predicting Stanford 
Achievement Test scores. 

3. CSCT scores were significantly related 
to IQ gains subsequently made by a group 
of retarded children undergoing play therapy. 


Received November 1, 1955. 
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A Note on Porteus Maze and Wechsler-Bellevue 


Scores as Related to Antisocial Behavior’ 


Kenneth Purcell * 


Mental Hygiene Consultation Service, Fort Dix, New Jersey 


As part of a study on antisocial behavior 
and some of its personality correlates, data 
were collected on Porteus Maze and short- 
form Wechsler-Bellevue performance. Previ- 
ous studies (1, 2, 6, 9), using the qualitative 
method for scoring the Maze Test described 
by Porteus (5), have consistently reflected 
differences between delinquents and nonde- 
linquents. Nevertheless, it seemed worth while 
to include the Maze Test in the present in- 
vestigation, mainly because the type of popu- 
lation used differed in important respects from 
earlier populations studied. Each of the for- 
mer investigations had compared institution- 
alized delinquents with noninstitutionalized, 
presumably normal nondelinquent youths. The 
population employed in this study consisted 
of noninstitutionalized young male adults, all 
of whom were referred to a mental hygiene 
clinic for psychiatric problems. Porteus (7) 
has cited evidence to show that the Maze 
Test successfully predicts general social ad- 
justment and adaptation. The question posed 
in this paper concerns the capacity of the 
Maze Test, qualitatively scored, to distin- 
guish poor social adjustment characterized 
primarily by antisocial behavior from other 
types of poor social adjustment. 

Hunt and French (3), in one of their re- 
ports on the CVS abbreviated intelligence 
scale, found a tendency for psychopaths and 
disciplinary cases to score lower on Compre- 
hension than on Similarities and Vocabulary. 
However, the differences found were not sta- 


1 The author wishes to express his appreciation to 
Carl Ejisdorfer, Marvin Greenstein, Maurice Kou- 
guell, and Dr. Philip Kubzansky who collected data 
for this study and assisted in its statistical evaluation. 

2 Now at the University of Kentucky. 
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tistically significant. It is suggested by the 
authors that this may well have been be- 
cause of inadequate criteria for defining a 
psychopathic population. Since the groups 
used in this study were reliably separated 
along the dimension of antisocial behavior, 
a further examination of CVS results seemed 
appropriate. In addition, on the basis of our 
experience at this clinic, it was felt that the 
Block Design subtest generally offered the 
most valid estimate of intellectual capacity 
among antisocial subjects. This appeared to 
be because negativism and resistance were 
minimal and interest maximal for this sub- 
test which Wechsler (8) reports as having 
one of the highest correlations with Full Scale 
IQ of all subtests (.73). 


Method 


A total of 56 male Army trainees between 
the ages of 17 and 26 with a mean age of 20 
were used as Ss for this experiment. All were 
psychiatric referrals to the mental hygiene 
clinic for whom psychological testing was 
requested. The only patients excluded were 
those suffering from organic brain damage or 
those diagnosed as psychotic. 

It was the intent of this study to relate 
psychological test results to overt behavior. 
Therefore, each S was rated on manifest anti- 
social behavior trends on the basis of social 
history data routinely obtained by one of the 
psychiatric social work specialists. Both mili- 
tary and civilian data were used in making 
this evaluation. Originally two raters inde- 
pendently judged the degree of antisocial be- 
havior on a seven-point scale with a reli- 
ability of .91 being obtained. This scale was 
later consolidated into a three-point scale to 
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Table 1 


Analysis of Variance of Qualitative Maze Scores of Three Groups Differing in Antisocial Behavior 











Group Mean* Variancef F p 
Group I 30.38 Between groups 3,543.97 3.39 05 
Group II 42.94 Within groups 1,044.15 
Group III 58.57 





*Ns = 24, 18, and 14 for Groups I, II, and III, respectively. 


t Between groups df = 2; Within groups df = 54. 


secure an adequate number of Ss for each 
category. Group I, the non-antisocial group, 
was composed of Ss whose overt aggressive 
behavior was minimal with no record of 
military or civilian disciplinary infractions. 
Group II, the intermediate group, consisted 
of Ss who may have been in fights with mod- 
erate frequency and whose record was marked 
by some school truancy, perhaps minor in- 
fractions of the law, and AWOL from the 
military. Group III, the antisocial group, 
consisted of Ss with an extensive pattern of 
delinquency including some criminal actions 
such as assault, rape, burglary, dope ped- 
dling, etc. 

The three groups were compared on quali- 
tative Maze Test scores and on the four 
Wechsler-Bellevue subtests noted above. In 
each case these measures were administered 
as part of a battery of clinical tests. 


Results and Discussion 


Analysis of variance of the qualitative maze 
scores, as shown in Table 1, discloses a dif- 
ference between the mean Q scores (total 
weighted qualitative score) of the three 
groups significant at the .05 level. This is 
not so significant a result as has been re- 
ported in other studies of the Maze Test. 
For example, Docter and Winder (1) report 
a difference between delinquents and nonde- 
linquents significant at the .0001 level. There 
are at least two reasons for the apparent in- 
sensitivity of the maze in differentiating be- 
tween these two groups in the present study. 
First, the range of Ss was marked by more 
continuity with respect to antisocial behavior 
than was true in other investigations. The 
presence of a group intermediate between the 
extreme antisocial and non-antisocial groups 
created additional overlap of scores. Second, 


the number of Ss used in this study, particu- 
larly if one considers only the two extreme 
groups, was considerably smaller than the Ns 
reported by other authors (1, 2, 5, 6, 9). If 
similar differences were obtained with larger 
Ns, the significance level would, of course, be 
higher. 

It will be remembered that Group I Ss 
were also psychiatric referrals, the great ma- 
jority of whom had manifested some form of 
social maladjustment in the Army situation. 
They may have been seclusive, overdepend- 
ent, easily frightened by authority or socially 
deficient in almost any respect except that 
none of these Ss were aggressively antisocial 
in their overt behavior. Since the Q scores of 
these Ss were only slightly higher, and on a 
relative basis not at all higher than the Q 
scores of normal controls used in previous 
studies, it is apparent that the qualitative 
Maze Test tends to be sensitive only to cer- 
tain types of social maladjustment. It seems 
to be maladjustment marked by nonconform- 
ing impulsivity that is singled out by this 
qualitative analysis. Impulsivity is probably 
the essential ingredient for high Q score. For 
example, one of the severely antisocial Ss was 
a shrewd, calculating relatively controlled per- 
son whose record reflected many of these 
characteristics. He had been arrested on nu- 
merous occasions but always managed to 
elude conviction. His Q score of 5 was one of 
the lowest for all Ss. In his behavioral notes 
the tester noted that S was calm, detached, 
and obviously out to demonstrate that he (S) 
could easily master the situation. 

Since several authors (1, 7) have noted a 
small but significant correlation between Q 
score and intelligence, the intellectual factor ° 
must be measured and accounted for in some 
way. The task of estimating the intellectual 
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capacity of antisocial Ss is difficult since their 
test performance is so colored by uncoopera- 
tive, negative attitudes. This was a particu- 
larly important problem in this study, as 
many of the Ss were actively engaged in in- 
tense rebellion against any form of military 
authority. 

Table 2 presents the results of the analysis 
of variance of the Wechsler-Bellevue data. 
The differences between the means of the 
three groups are significant for Prorated IQ, 
Similarities, Vocabulary, and Comprehension 
scores with antisocial Ss achieving the lowest 
scores in each instance. By contrast, the dif- 
ferences found in Block Design means do not 
even approach significance. The reasons for 
this finding are at least twofold. First, almost 
every antisocial S$ had been an habitual tru- 
ant and had left school much earlier than the 
non-antisocial Ss. The benefits of educational 
experience on such tests as Vocabulary or 
Comprehension were lost to these Ss. Second, 
the verbal interaction required by the verbal 
subtests reproduces former unpleasant school 
situations and creates many more opportuni- 
ties for the arousal of authority conflicts than 
occurs during administration of the Block 
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Design test. During the latter, S may focus 
his attention on impersonal objects without 
the necessity of replying directly to the ex- 
aminer. Antisocial individuals seem to find it 
less necessary to assert their rebellious non- 
conformity in this more oblique relationship 
with the examiner. 

Turning briefly to a comparison of Com- 
prehension scores with those of Similarities 
and Vocabulary, we find, from an inspection 
of means and significance levels indicated in 
Table 2, the same tendency for antisocial Ss 
to score relatively lower on Comprehension, 
as was noted by Hunt and French (3). Al- 
though the differences between these subtests 
do not attain statistical significance, the 
qualitative variation between the groups in 
their responses to certain items of the Com- 
prehension test seems worthy of note. The 
answers of antisocial Ss to the questions on 
the envelope, bad company, taxes, and law, 
commonly contained antisocial content in 
some part of the response, although giving 
credit to the response might be required un- 
der the rules of scoring. Perhaps a revised 
scoring scheme or the remodeling of Com- 
prehension to encourage antisocial answers in 


Table 2 





Analysis of Variance of Wechsler-Bellevue Subtest Scores of Three Groups Differing in Antisocial Behavior 








Variable Mean* Variancef F p 
Prorated IQ 
Group I 103.45 Between groups 3,296.47 9.36 001 
Group II 96.26 Within groups 352.26 
Group ITI 76.35 
Block Design 
Group I 9.62 Between groups 8.14 65 Non- 
Group IT 8.94 Within groups 12.40 signif. 
Group III 8.28 
Comprehension 
Group I 10.22 Between groups 113.06 10.77 001 
Group II 8.39 Within groups 10.49 
Group III 5.00 
Vocabulary 
Group I 9.34 Between groups 50.32 5.22 01 
Group II 8.93 Within groups 9.64 
Group III 6.56 
Similarities 
Group I 11.00 Between groups 101.72 7.47 01 
Group II 9.81 Within groups 13.61 
Group ITI 6.83 





* Ns = 24, 18, and 14 for Groups I, II, and III, respectively. 


Tt Between groups df = 2; Within groups df = 54. 
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accordance with the suggestion of Hunt and 
French (3) would improve its discrimination 
along the dimension of antisocial trends. 


Summary 


One of the questions with which this study 
was concerned was whether the Porteus Maze 
test, qualitatively scored, could successfully 
distinguish between poor social adjustment 
marked by antisocial behavior and various 
other types of poor social adjustment. The 
second aspect of the study dealt with scatter 
scores among the Comprehension, Vocabulary, 
Similarities, and Block Design subtests of the 
Wechsler-Bellevue Intelligence Scale. 

Fifty-six young adult Army trainees, all of 
them referrals to the Mental Hygiene Con- 
sultation Service, serve as Ss. The Ss were 
divided into three groups on the basis of wide 
variations in overt antisocial behavior. 

Qualitative Maze Test scores differentiated 
significantly between three groups, the most 
antisocial group having the largest Q score. 
It appeared that impulsivity was the basic 
personality variable measured by qualitative 
evaluation. 

Finally, our impression that Block Design 
performance would be the subtest least im- 
paired by antisocial tendencies was confirmed. 
The results showed no significant differences 
between the three groups in Block Design 
scores, although the Comprehension, Vocabu- 
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lary, and Similarities scores differed signifi- 
cantly. An explanation for these findings is 
offered in terms of the more remote examiner- 
subject relationship during Block Design ad- 
ministration as compared with the other three 
subtests. 


Received December 20, 1955. 
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and Nonstuttering Children’ 


Leonard D. Goodstein 


State University of Iowa 
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While there has been much published specu- 
lation and theorizing about the etiology of 
stuttering (e.g., 1, 8), until recently there 
have been relatively few empirical studies of 
the onset of this fairly common clinical prob- 
lem. Darley (2), Johnson (9), LaFallotte 
(12), and Moncur (13, 14) have all at- 
tempted to investigate the circumstances sur- 
rounding the development of stuttering by 
intensively interviewing the parents of stut- 
tering children. The present study represents 
a cross validation and extension of these prior 
investigations and includes an objective meas- 
ure of parental adjustment, the Minnesota 
Multiphasic Personality Inventory (MMPI). 
This paper reports the MMPI results, while 
the more general findings are reported else- 
where (10). 

The MMPI has been used in a similar way 
for evaluating parental adjustment in its re- 
lation to child behavior by Williams (18) in 
a study of families in which there was a 
cerebral-palsied child, and by Grossman (7) 
in relation to stuttering. The present study 
is an extension of this method to a larger, 
more carefully selected set of parents. 


Subjects and Procedures 


A family was considered for inclusion in 
the stuttering or experimental group if one 


1 This research was supported by a grant from the 
Louis W. and Maud Hill Family Foundation of St. 
Paul, Minnesota. The authors wish to thank Dr. 
Wendell Johnson, the principal recipient of the 
grant, for his help and encouragement during the 
execution of the research and the preparation of this 
manuscript. 


or both of the parents regarded the child as 
a stutterer and if the duration of the stutter- 
ing was reported as less than 36 months. A 
family was considered in the nonstuttering or 
control group if the child had never been re- 
garded as a stutterer by either of the parents. 
Each experimental family was matched to a 
control family on the basis of the child’s age 
and sex as well as the socioeconomic status 
of the parents. Only essentially physically 
normal, white children were included in the 
study. The experimental subjects (Ss) had 
been referred to the experimenters by speech 
therapists, physicians, etc., while the control 
Ss were secured by appeals for volunteers to 
PTA’s, church groups, etc. The complete de- 
tails of subject selection are available else- 
where (10). 

The routine administration of the booklet 
form of the MMPI was part of the standard 
interview procedure (see 10) that was fol- 
lowed with each parent who had agreed to 
participate in the over-all study. Usually one 
parent took the test while the other parent 
was being interviewed. If the tests were not 
completed before the interviewer was ready 
to leave, arrangements were made to have the 
booklets and answer sheets picked up at some 
later time. 

While it had been planned to secure an 
MMPI profile from each parent, this goal 
was not realized due to a number of uncon- 
trollable factors such as lack of time, refusal 
to cooperate, etc. MMPI profiles, however, 
were obtained from 224 of the 300 experi- 
mental parents and from 223 of the 300 con- 
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trol parents. An inspection of the other data 
did not suggest that those parents who had 
failed to complete the MMPI were different 
in any obvious ways from those parents for 
whom profiles were secured. 

All MMPI answer sheets were hand scored 
by the interviewers for the usual four va- 
lidity and nine clinical scales as well as four 
special scales that were thought, on an a 
priori basis, to represent variables important 
to the onset of stuttering. These four special 
scales were the Taylor Manifest Anxiety Scale 
(A) (15), the Achievement Scale (Ac) (5), 
the Dominance Scale (Do) (6), and the 
Status Scale (St) (4). All profiles were also 
coded into Welsh’s high-point coding system 
(16). For ease in tabulation and statistical 
analysis, the scores on the 17 scales as well 
as the high-point codes were then coded into 
a predetermined coding system and punched 
on standard IBM punch cards. 


Results 


Four groups of 100 profiles each, experi- 
mental mothers, experimental fathers, con- 
trol mothers, and control fathers were se- 
lected from the 447 available MMPI profiles 
by excluding the 47 profiles whose validity 
scale scores suggested a significant degree of 
dissimulation. The cutting scores for exclu- 
sion were: L-scale raw score of 7 or higher, 
or F-scale raw score of 17 or higher, or K- 
scale raw score of 28 or greater. The num- 
bers of cases excluded on the basis of these 
criteria were 17, 22, and 8, respectively; 
there were no obvious differences between the 
control and experimental parents with respect 
to these validity criteria. 

The means and standard deviations on the 
17 MMPI scales for the experimental and 
control parents, together with the significance 
levels of the differences between the experi- 
mental and control means are presented in 
Table 1. For ease in communication the 
scores in Table 1 are reported in values based 
on the original distributions of scores rather 
than the coded score distributions. The re- 
ported values are, consequently, approximate 
rather than exact; all statistical tests, how- 
ever, were based upon the more exact coded 
score values. 

A prior comparison of the means of the 
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Table 1 


Means and SDs* of the MMPI Scales for the Two 
Groups and the Significance Levels of the 
Differences Between the Means 
(N = 200 in each cell) 














Experimental Control 
parents parents 
Scalet Mean SD Mean SD pt 
? 6.5 7.9 5.8 4.6 ns 
L 3.3 1.9 3.6 1.8 ns 
F 3.2 2.5 2.8 1.8 ns 
K 15.5 $2 16.9 48 05 
Hs 51.6 8.7 53.0 7.4 ns 
D 53.9 10.3 52.4 9.2 ns 
Hy 55.5 7.9 56.1 7.7 ns 
Pd 53.2 9.6 54.8 9.0 ns 
Mf§ 52.5 10.1 52.7 10.1 ns 
Pa 52.8 78 52.3 6.8 ns 
Pt 53.0 8.6 51.9 7.7 ns 
Se 51.7 9.0 52.2 7.8 ns 
Ma 51.9 10.1 51.2 9.1 ns 
A 14.3 7.6 11.8 6.9 O01 
Ac 16.9 2.9 16.8 2.6 ns 
Do 17.3 3.3 18.8 3.3 01 
St 21.4 3.9 22.5 4.0 ns 





* These values are approximate as they were derived from 
the coded scores. 

+ The scores on the clinical scales (Hs, D, Hy, Pd, Mf, Pa, 
Pt, Sc, and Ma) are in T-score units; all the others are expressed 
in raw score units. K-scale corrections included where appro- 
priate. 

t The ¢ tests for mean differences as well as the significance 
levels for these differences were based upon the more precise 
coded score values rather than the values reported in this table. 

§ Both groups of fathers were significantly higher (p < .01) 
than both groups of mothers; there were, however, no signifi- 
cant differences between the stutterers and nonstutterers. 


experimental mothers with the experimental 
fathers and of the control mothers with the 
control fathers gave no evidence of any sex 
differences in the scores and the data from 
the two sexes were combined. The only ex- 
ception to this finding was for the Mf scale 
on which both groups of fathers had signifi- 
cantly higher (p < .01) mean T scores than 
both groups of mothers; there were, however, 
no significant differences between the experi- 
mental and control groups and the data were 
therefore combined in this case as well. 
There were only three significant mean dif- 
ferences between the experimental and the 
control parents on the 17 MMPI scales. The 
control parents had significantly higher mean 
scores on the K and Dominance (Do) scales 
(p < .05, p<. .01, respectively) while the ex- 
perimental parents had a significantly higher 
mean score on the Anxiety (A) scale (p < 
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Table 2 
Relative Frequencies of the Initial Scale in the High-Point Code for Each Group 











MMPI Clinical Scales* 





Ma 





Group Hs D oa, SS oe Ps Pt Se Total 
Experimental mothers 4 13 18 12 15 12 & 6 12 100 
Experimental fathers 5 17 16 8 23 4 5 4 18 100 

Subtotal 9 30 34 20 38 16 13 10 30 200 
Control mothers 3 18 27 22 6 7 7 5 5 100 
Control fathers 1 10 17 18 27 4 5 3 15 100 

Subtotal 4 28 44 40 33 11 12 8 20 200 
Grand total 13 58 78 60 71 27 25 18 50 400 





* Only the nine clinical scales were used in this analysis. 


01). None of the other mean differences were 
significant at an acceptable level of confidence. 

The relative frequencies in the four groups 
of Ss of the initial scale in the high-point 
code, that is, the clinical scale with the high- 
est 7-score value in the individual profiles, 
are presented in Table 2. The differences be- 
tween the obtained frequencies were then 
analyzed by means of the chi-square test for 
a two-by-nine table. There were no significant 
differences between the experimental and con- 
trol fathers, between the experimental and 
control mothers, or between the experimental 
and control parents combined over sex. There 
were, however, significant differences between 
the total male group and the total female 
group when the experimental and control 
groups are combined (p < .001). It may be 
seen in Table 2 that the Mf and Ma scales 


occurred relatively more frequently in the 
initial position in the male codes while the 
Pa and Sc scales were the more frequent ini- 
tial points in the female codes. 

The relative frequencies in the four groups 
of Ss of the secondary scale in the high-point 
code, that is, the clinical scale with the second 
highest T-score value in the individual pro- 
files, are presented in Table 3. The differences 
between the obtained frequencies were again 
analyzed by the means of the chi-square test 
for a two-by-nine table. Again there were 
no differences between the experimental and 
control fathers, the experimental and control 
mothers, or between the total experimental 
and the total control group. Again there were 
significant differences between the total male 
group and the total female group when the 
experimental and control groups are com- 


Table 3 
Relative Frequencies of the Secondary Scale in the High-Point Code for Each Group 








MMPI Clinical Scales* 








Group Hs D Hy Pd Mf Pa Pt Se Ma Total 
Experimental mothers 9 11 21 12 6 14 12 9 6 100 
Experimental fathers 4 18 13 14 15 11 8 5 12 100 

Subtotal 13 29 34 26 21 25 20 14 18 200 
Control mothers 9 6 20 14 13 12 8 7 11 100 
Control fathers 2 13 18 16 16 9 6 9 11 100 

Subtotal 11 19 38 30 29 21 14 16 22 200 
Grand total 24 48 72 56 50 46 34 30 40 400 





* Only the nine clinical scales were used in this analysis. 
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bined (p < .02). It may be seen in Table 3 
that the D, Mj, and Ma scales occurred rela- 
tively more frequently in the secondary po- 
sition in the male codes while the Hs and Hy 
scales were the more frequent secondary 
scales in the female codes. 

The number of parents in the two groups 
with one or more T scores above 70 on the 
clinical scales (the usual cutting score) was 
then obtained in an effort to evaluate the rela- 
tive frequency of “severe disturbance” in the 
two groups. There were 47 persons with one 
or more 7 scores above 70 in the control 
group and 49 persons in the experimental 
group; by eliminating from consideration 
those cases where both parents in the family 
had scores above 70, these numbers were re- 
duced to 41 and 42, respectively. An inspec- 
tion of the high-point codes of those “severely 
disturbed” individuals did not reveal any dif- 
ferences between the experimental and con- 
trol groups although the sex differences that 
were discussed above (see Tables 2 and 3) 
were again present. 

In an attempt to evaluate the differences 
in the severity of stuttering as a function of 
the “abnormality” of the MMPI scores, the 
experimental group was then divided into two 
subgroups, those families in which one or 
both parents had one or more clinical scale 
T scores above 70 (the disturbed group) and 
those families in which neither parent had a 
clinical scale T score above 70 (the nondis- 
turbed group). Independent ratings by the 
interviewers of the experimental children on 
the Iowa Scale for Rating the Severity of 
Stuttering (11, pp. 129-136) were available 
for 38 children whose parents were placed in 
the disturbed group and 77 children whose 
parents were in the nondisturbed group. The 
mean rating for the children of the disturbed 
parents was 1.97 (SD = .97) while the mean 
rating for the children of the nondisturbed 
parents was 3.13 (SD = .71); the differences 
between these means is highly significant (¢ 
= 6.4, p < .001) and indicates that the chil- 
dren of the nondisturbed parents were the 
more severe stutterers. (A rating of 1 on the 
Iowa Scale indicates normal, fluent speech 
while a rating of 3 indicates a relatively mild 
degree of stuttering.) 
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Discussion 

The most striking finding of this portion of 
the total study was the failure to establish 
any major differences between the parents of 
the stuttering and nonstuttering children. Al- 
though certain differences between the two 
groups of parents were statistically signifi- 
cant and psychologically provocative, the 
over-all impression left by the results is that 
whatever important differences exist between 
these groups is not reflected in their MMPI 
scores. Certainly the small absolute differ- 
ences between the means and the large 
amount of overlap between the two distribu- 
tions of scores would prohibit the use of the 
MMPI as an instrument for differentiating 
these groups in clinical work. 

These results are in essential agreement with 
those of Grossman (7) who reported that the 
parents of stutterers gave more atypical or 
pathological responses on the MMPI than 
the parents of nonstutterers on the F scale 
but did not differ on any of the other scales. 
Her results were obtained from older, more 
chronic stutterers and her parent groups in- 
volved only an N of 42 each. It should be 
noted that the differences between her two 
groups were small in absolute magnitude and 
statistically unreliable. 

It is essential to note that not only are both 
sets of parents very similar to each other in 
their mean MMPI profiles but they are also 
very similar to the original standardization 
group of Minnesota normals (17). The high- 
est clinical scale mean for either group is 56.1 
for the control parents on the Hy scale; this 
represents a deviation of only 0.6 SDs from 
the expected score of a group of psychiatri- 
cally normal persons, that is, 50.0. The high- 
est clinical scale mean for the experimental 
group, who presumably should be more mal- 
adjusted, is 55.5, also on the Hy scale. These, 
of course, are the largest differences from the 
expected normal mean score and all the other 
obtained means are closer to the expected 
value of 50.0. While some of these differences 
may be of statistical significance, they are 
not, in all probability, psychologically impor- 
tant. It is more reasonable to ascribe such 
small differences to sampling factors than to 
any “real” differences between the individuals 
studied in the present investigation and the 
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standardization group. These differences cer- 
tainly are not evidence for regarding either 
the control or experimental groups of parents 
as psychiatrically abnormal or more malad- 
justed than adult persons in general. 

The absence of abnormality in these mean 
profiles is even more strikingly brought home 
when these profiles are compared with the 
mean profiles of various neurotic and psy- 
chotic groups (17) and of college students 
(3, 17). Each of these latter groups have 
characteristic mean profiles with very marked 
peaks and valleys differing greatly from the 
flat mean profiles obtained from these par- 
ents. Such comparisons strongly support the 
contention that these individuals are, for the 
most part, essentially normal persons who 
differ little, if at all, from the group of nor- 
mal adult Minnesotans who comprised the 
standardization sample. These results inci- 
dentally, especially those with the control 
group, offer strong support for the continued 
use of the MMPI as a screening instrument 
with presumably normal adults. 

While the present results offer no evidence 
for concluding that these two groups of par- 
ents are very different from each other or 
from normal individuals generally, it should 
be recalled that 3 of the 17 mean differences, 
although the absolute differences were quite 
small, were significant and deserve further 
comment. On the K scale the control parents’ 
mean was significantly higher than that of 
the experimental parents. This would indicate 
that the control parents were somewhat more 
defensive, less likely to admit psychological 
defects or problems, than the experimental 
parents. These control parents had not been 
asked, as had the experimental parents, to 
admit as part of the selection procedure that 
they regarded their children as deviant, and 
certainly might be expected to be less open 
and more defensive in such a clinical situation. 

The finding that the control group mean on 
the Dominance (Do) scale was significantly 
higher than that of the experimental group, 
suggesting greater dominance in face-to-face 
personal relationships, is rather difficult to in- 
terpret and fit into the other findings and 
conclusions about the onset of stuttering. 
Little is known about the interpretation and 
usefulness of the Do scale as it is a recent 
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addition to the MMPI scales and perhaps any 
more elaborate interpretation of this result 
should be postponed until additional informa- 
tion about the Do scale is available. 

The experimental parents were significantly 
higher on the Anxiety (A) scale than were 
the control parents. Certainly the admission 
of greater tension and anxiety on the part of 
the stutterers’ parents may be regarded as an 
important finding, but great care should be 
exercised in relating this finding to the onset 
of stuttering. The heightened anxiety on the 
part of the experimental parents may be an 
important etiological factor in the develop- 
ment of stuttering, or may be a subsequent 
reaction by these experimental parents to the 
development of stuttering by their children. 
The present results do not permit a resolu- 
tion of these alternative interpretations. 

Perhaps the best evidence that the experi- 
mental group of parents was essentially nor- 
mal from a psychiatric point of view was the 
finding that the “deviant” group of experi- 
mental parents, those with one or more T 
scores above 70, had children with little or 
no stuttering as compared with the nondevi- 
ant experimental group. On the other hand, 
the MMPI profiles of the parents of the more 
severe stutterers were almost uniformly within 
normal limits and gave no indication of severe 
psychological pathology of any sort. It may 
be that these deviant parents volunteered for 
participation not so much because they were 
concerned with nonfluencies in their chil- 
drens’ speech, but rather with their own prob- 
lems; they may have regarded participation 
in the experiment. as one way of receiving 
help with these more general problems, rather 
than an opportunity to discuss their chil- 
drens’ speech. Such an interpretation would 
indicate a need for caution on the part of the 
speech therapist when he does find a stut- 
terer’s parent with a deviant MMPI profile. 

The MMPI results are in general agree- 
ment with those of the over-all investigation 
(10) in that they do not support the notion 
that the parents of stutterers are very differ- 
ent in their behavior and attitudes from the 
parents of nonstutterers. Rather these results 
support the suggestion that those group dif- 
ferences that are important in the etiology of 
stuttering are rather limited and probably 
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quite specific to the speech situation and the 
stuttering phenomenon. 


Summary 


The Minnesota Multiphasic Personality In- 
ventory (MMPI) was administered to 447 
parents participating in an interview study 
of the onset of stuttering in an attempt to 
compare the relative psychological adjust- 
ment and personality characteristics of par- 
ents of stuttering children (experimental 
group) and parents of nonstuttering children 
(control group). Four groups of 100 MMPI 
records, experimental fathers, experimental 
mothers, control fathers, control mothers, 
were selected by excluding those records 
whose validity scores suggested dissimulation. 

There were only three significant mean dif- 
ferences between the experimental and con- 
trol parents on the 17 MMPI scales used in 
this study; the control parents had signifi- 
cantly higher mean scores on the K and Do 
scales while the experimental parents had a 
significantly higher mean score on the A 
scale. A comparison of the high-point codes 
of the experimental and control groups re- 
vealed no significant differences between these 
two groups. There were no significant differ- 
ences in the number of parents in the two 
groups with one or more clinical scale T 
scores above 70, the usual cutting score in- 
dicating some degree of psychological pa- 
thology. The children of the experimental 
parents with T scores above 70 were shown 
to have been rated as having significantly 
less severe stuttering than the remainder of 
the experimental group children. 

These results were interpreted as indicating 
that the performances of these two groups 
were not significantly different, nor was the 
performance of either group significantly dif- 
ferent from the expected performance of psy- 
chiatrically normal persons. There was no 
evidence that the etiology of stuttering is 
rooted in gross psychological pathology of the 
stutterer’s parents but rather the results are 
to be interpreted as supporting the notion 
that such attitudes and adjustments of the 
parents as may be involved in the etiology of 
stuttering are probably limited and specific 
to the speech situation. 


Received January 3, 1956. 
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Response Bias and the MMPI 


Eugene H. Barnes 
VA Hospital, Downey, Illinois 


In a recent article Berg (2) has proposed 
what he terms “the deviation hypothesis.” 
This hypothesis is stated as follows: “Deviant 
response patterns tend to be general; hence 
those deviant behavior patterns which are 
significant for abnormality and thus regarded 
as symptoms, are associated with other devi- 
ant response patterns which are in noncriti- 
cal areas of behavior and which are not 
regarded as symptoms of personality aberra- 
tion” (2, p. 62). Deviant responses are de- 
fined as those which differ from modal re- 
sponses or criterion group responses at a 
given level of significance. Berg states, “To 
test the hypothesis stated above, it is neces- 
sary only to identify the common or modal 
responses to any suitable series of stimulus 
patterns and predict that those subjects whose 
responses rather consistently go against the 
modal preferences will be deviant in the sense 
of exhibiting symptom patterns associated 
with abnormal states. It should be noted that 
content of stimulus patterns is not regarded 
as important” (2, pp. 62-63). 

In a study pertinent to this hypothesis 
Barnes (1) used the Perceptual Reaction Test 
(PRT) as an instrument for eliciting deviant 
response patterns. The PRT is a series of 
sixty abstract designs made with ruler and 
compass to which the S$ is required to re- 
spond by marking one of the following op- 
tions: Like much, like slightly, dislike slightly, 
or dislike much. By the use of deviant re- 
sponse patterns alone it was possible to con- 
struct scales for certain categories of psy- 
chopathology. In this study it was found 
that psychotic groups differed in their option 
preferences from normal groups in that the 
former generally preferred like much while 
the normal groups tended to prefer dislike 
slightly. A set to choose dislike much tended 
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to be characteristic of character disorders in 
contrast to the performance of psychotic and 
normal groups. Cottle and Powell (4) dem- 
onstrated that answering all of the MMPI 
items “true” produced a profile with high 
scores on the psychotic triad while scoring all 
items “false” produced high scores on the 
neurotic triad. Berg cites these studies as evi- 
dence suggesting that, ‘““a marked tendency to 
say ‘yes’ or ‘agree,’ etc., in personality inven- 
tories should be associated with psychotic 
states while ‘no’ or ‘disagree’ option choices 
should be associated with neurotic states’ 
(2, p. 66). 

There are evidently two parts to the devia- 
tion hypothesis as stated by Berg. The first 
and major part concerns merely the associa- 
tion of deviant response patterns on psycho- 
logical tests with deviant behavior patterns. 
It asserts a general relationship between devi- 
ant test behavior and the deviant behavior of 
psychopathological conditions. This hypothe- 
sis draws together a variety of behavioral 
phenomena and makes explicit an assumption 
which has been implicit in much prior work. 
Few would argue that a “blood response” on 
the Rorschach was deviant behavior in a so- 
cially critical area, while few would disagree 
that strangling one’s mother is. Nevertheless, 
a tenuous relationship has been assumed be- 
tween “acting out” behavior and the socially 
nonsignificant response of “blood.” The sec- 
ond and minor portion of the hypothesis con- 
cerns the directionality of such deviant re- 
sponse patterns as have been observed. This 
portion of the hypothesis asserts, in effect, 
that different types of response sets or biases 
are associated with different types of deviant 
behavior. 

Individual record forms of the MMPI are 
a convenient source of material for testing 
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certain deductions from the deviation hy- 
pothesis since the scoring system provides 
that only those items which are answered in 
the infrequent direction are recorded (6, p. 
8). These items are, by Berg’s definition, 
deviant responses. Thus, if content of stimu- 
lus patterns is not important, it would be in- 
ferred that the number of deviant responses, 
i.e., Xs, on the record form of the MMPI 
should be as effective a measure of psycho- 
pathology in general as the profile itself. A 
scale of the MMPI which specifically pur- 
ports to measure a psychotic reaction is the 
Sc scale. This scale, then, should correlate 
significantly with the total number of deviant 
responses. A preliminary check with grouped 
data revealed that the number of Xs corre- 
lated .92 with the Sc scale for a sample of 24 
male veterans in a neuropsychiatric hospital. 
The correlation to be expected from the num- 
ber of common elements is .38 as calculated 
from a formula given by McNemar (8, p. 
118). This limited confirmation suggested a 
more systematic exploration. 

A large number of studies have indicated 
that the MMPI and the groups of scales 
known as the neurotic triad (Hs, D, Hy) 
and psychotic triad (Pa, Pt, Sc) have useful 
clinical validity (3). By inference from the 
deviation hypothesis, the total number of 
deviant responses should generally correlate 
with scales indicative of “symptom patterns 
associated with abnormal states” (2, p. 63). 
From the second part of the hypothesis it 
follows that deviant responses on the MMPI 
resulting from responding “true” when the 
modal response is “false” should correlate 
most highly with the scales of the psychotic 
triad while responding “false” when the modal 
response is “true” should correlate more 
highly with the scales of the neurotic triad. 
Since these tendencies to prefer certain option 
choices are conceived as being different re- 
sponse sets, the tendency to answer atypically 
true should not be correlated with the tend- 
ency to answer atypically false. 


Procedure 


Subjects. The Ss were 40 males who, for 
one reason or another, had been given the 
MMPI by the author. These Ss were a 
very heterogeneous group including psychotic 


outpatients, neurotics, vocational counseling 
cases, and court referrals. Variances on the 
MMPI clinical scales are, therefore, doubt- 
less larger than from a random sample of the 
general population. It should also be noted 
that they were not representative of any 
homogeneous clinical grouping. The group 
was probably representative of those who 
come to the attention of a psychologist in 
private practice. 

Method. Pearson rs were machine calcu- 
lated for correlations of each of the clinical 
scales and three validating scales with, first, 
total number of deviant responses, second, 
number of deviant responses resulting from 
answering items false, and third, number of 
deviant responses resulting from answering 
items true. 


Results 


Results are summarized in Table 1. It will 
be noted that the differences in relative sizes 
of correlation coefficients on the neurotic triad 
and psychotic triad are all in the predicted 
direction, a result significant, without respect 
to magnitude, at less than a 2 per cent level 
of confidence as calculated by the sign test. 
The correlation between the deviant responses 
obtained by answering false and those ob- 
tained by answering true is less than the 
standard error and we may accept the hy- 


Table 1 
Correlations of MMPI Scales with Deviant Responses 











Total Deviant Deviant 
number of responses responses 
deviant answered answered 
Scale responses true false 
L —.09 —.24 37 
F 74 .66 43 
K —.60 — .69 .06 
Hs .66 52 56 
D 56 34 73 
Hy 12 — .09 59 
Pd 71 68 .28 
Mf 32 .28 21 
Pa .60 52 37 
Pt 87 84 31 
Se .93 90 36 
Ma 38 50 21 
Dev. responses 
answered false Al 
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pothesis of no correlation. Certainly the total 
number of deviant responses is significantly 
associated with the clinical scales. 


Discussion 

This study was designed to test certain de- 
ductions derived from the deviation hypothe- 
sis and collateral research, not as a definitive 
test of the total hypothesis. Within this re- 
stricted frame of reference the deductions 
have been confirmed and this confirmation in- 
directly lends support to the hypothesis from 
which they were derived. 

The deviation hypothesis does not take into 
consideration the matter of item content. If a 
preference for a response option exists in a 
normal population, deviating from this pref- 
erence should add some increment to an in- 
dicator of psychopathology. In the present 
study we find evidence of such a phenome- 
non. With no attention given to particular 
items or their content, a mere summing of 
the deviant responses bears a striking rela- 
tionship to certain of the MMPI clinical 
scales, as shown in Table 1. 

One interesting implication of this portion 
of the hypothesis is that it would provide a 
theoretical rationale for the behavior of the 
F scale of the MMPI. This scale was con- 
structed from items which were very rarely 
answered in the scored direction by normal 
Ss (5). On inspection these items seem to 
have no psychological coherence or commu- 
nality, yet it has been observed that the F- 
scale score is elevated in personality aberra- 
tions (7, 9). The F scale has even been 
dubbed somewhat informally and inelegantly 
“the screwball scale.”* The deviation hy- 
pothesis would predict just such a result from 
a scale so constructed. It would, in fact, pre- 
dict that even a random selection of items 
scored in this manner would give a similar 
result. 

With respect to the direction of deviation, 
it is clear that deviating by atypically answer- 
ing true is associated with MMPI scales in- 
dicative of a psychotic type of pathology 
while atypically answering false is associ- 
ated with scales indicative of a neurotic re- 


1 If memory serves correctly the author first heard 
this label used by Dr. E. S. Bordin. 


action. It may be argued that this result is 
obtained simply because the scales of the 
neurotic triad are more heavily weighted with 
items scored when they are answered false 
and those of the psychotic scales more 
heavily weighted with items scored when an- 
swered true. Although it is a fact that a de- 
gree of such differential weighting exists, such 
an “explanation” is circular in that it fails to 
explain the origin of such a weighting. Why 
did it happen, in the empirical derivation of 
the scales, that more items were found to 
discriminate between certain clinical groups 
and normals when answered in just these di- 
rections? In view of the results obtained in 
this study and the other studies cited, an an- 
swer in terms of chance distribution of item 
content seems untenable. It seems increas- 
ingly probable that response sets play a 
much more important role in personality test- 
ing than has been suspected. 


Summary 


A study was undertaken to test some de- 
ductions concerning the MMPI derived from 
Berg’s deviation hypothesis (2). According to 
this hypothesis Ss whose responses consist- 
ently go against modal preferences should 
display symptom patterns of abnormality 
Further, this hypothesis would predict that 
atypically responding true should be asso- 
ciated with psychotic states while atypically 
answering false should be associated with neu- 
rotic conditions. These two tendencies should 
be relatively unrelated to one another. 

Since the record form of the individual 
MMPI provides that only items answered 
in the infrequent direction are entered, it is 
possible to count the number of deviant re- 
sponses, to separate these into those ob- 
tained by answering atypically false and 
atypically true, and to correlate these scores 
with the scales of the MMPI. These opera- 
tions were performed and it was found, as 
predicted, that the total number of deviant 
responses correlated highly with the clinical 
scales, that the tendency to answer atypi- 
cally true was more highly correlated with the 
scales of the psychotic triad while the tend- 
ency to answer atypically false was more 
highly correlated with the neurotic triad, and 
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that the two tendencies were not significantly 
correlated. 

It is concluded that the deviation hypothe- 
sis has received some indirect confirmation 
both with respect to its general statement 
and with respect to direction of deviation. 
Also, it is concluded that response sets of 
this type play a larger part in personality 
testing than has been suspected. 


Received December 1, 1955. 
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Delaying Capacity, Fantasy, and Planning Ability: 
A Factorial Study of Some Basic Ego Functions 
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An increasing ability to delay gratification 
and to inhibit a direct motor approach to a 
desired object appears to be a necessary fea- 
ture in the development of an identifiable self 
in human personality. Intimately associated 
with this delaying capacity is the ability to 
resort to thought or ideation as “experimen- 
tal action” (8) in a behavioral medium freed 
of temporal and spatial boundaries. As early 
as 1900 Freud (7) had suggested a theoreti- 
cal linkage of delay and thought development 
and this relationship became a pivotal fea- 
ture of his concept of the reality principle 
and, later the ego. Similar theoretical and 
empirical associations of thought and motor 
inhibition or delay have also been described 
by psychologists and psychiatrists in diverse 
fields (13, 19, 21, 30). With the increased 
attention paid by psychoanalytic theorists to 
ego psychology and to functions such as plan- 
ning, synthesizing, and controlling motility 
(9, 11, 19), the possibility of integrating the 
psychoanalytic hypotheses with the concepts 
of psychologists involving the theoretical link 
between fantasy, language, and motor behav- 
ior emerges more clearly (22). The present 
study represents a further step in a program 
of research aimed at investigating some rela- 
tionships of thought and delay in ego struc- 
ture. 

The specific approach to this problem has 
been through the medium of an intensive 
study of the Human Movement response (/) 
on the Rorschach inkblots and its behavioral 


1 The authors are indebted to Drs. John F. Winne 
and Jacob Cohen who gave valuable assistance in 
the statistical analysis of the data and to the psy- 
chology staff, trainees, and ward personnel who par- 
ticipated in some of the evaluative procedures. 


correlates. Rorschach (20) pointed out that 
those persons who saw numerous human fig- 
ures in motion on the blots tended to be 
inhibited in their own motor activity and 
given to considerable imagination or fantasy, 
“movements in the mind’s eye.” In effect, 
therefore, Rorschach’s observations, and those 
of subsequent Rorschach workers (16, 18, 21) 
suggest that the M response may serve as a 
crude measure of delaying capacity and of 
imaginative tendencies which are basic ego- 
functions in Freud’s formulation. The experi- 
mental work on the vicarious functioning of 
sensory and motor patterns carried out by 
Werner and Wapner and their colleagues is 
consistent with this approach and has car- 
ried it a step toward more precise formula- 
tion in perceptual theory (30, 31, 32). A 
number of studies in the present series have 
also supported Rorschach’s observations by 
demonstrating that individuals whose Ror- 
schachs contain relatively more M responses 
are capable of greater motor inhibition in 
slow-writing tasks, move about less in a 
waiting-room situation, and tend to tell more 
imaginative stories to TAT pictures (14, 23, 
24). In the study to be presented here this 
work will be extended to include a large num- 
ber of tests and behavioral situations which 
presumably tap aspects of impulse control or 
delaying capacity, planfulness, and fantasy. 
The general problem raised is whether con- 
sistent individual patterns emerge along a 
dimension that links delaying capacity, fan- 
tasy, planning ability, and various measures 
of motility control. 

In approaching a concrete formulation of 
the general hypothesis a series of tests and 
behavioral indices were selected which, while 


375 








376 


superficially dissimilar, were expected to de- 
fine a common fantasy-motility factor. The 
tests included a general intelligence measure, 
employed as a control since it was felt neces- 
sary to determine the extent to which the 
dimension in question was related to intelli- 
gence. The Rorschach test was included, with 
Rorschach’s M response in particular ex- 
pected to yield a high loading on a fantasy- 
motility factor. In order to evaluate the gen- 
erality of the Human Movement phenomenon 
and to control for total number of Rorschach 
responses, Barron’s Movement Threshold Ink- 
blots (1) were included. Another estimate of 
fantasy tendencies was obtained from stories 
told by the Ss to four Thematic Apperception 
Test pictures scored for “transcendence,” i.e., 
introduction of characters, emotions, and 
temporal references which went beyond the 
mere description of the picture. Still another 
manifestation of fantasy tendencies charac- 
teristic of ego functioning might be an in- 
clination toward setting high goals. A meas- 
ure of level of aspiration was therefore in- 
cluded as well as a measure of sheer speed 
of performance in a simple motor task. The 
latter measure, however, would not be ex- 
pected to show a significant loading on the 
factor in question. Another aspect of the 
adaptive nature of a developed capacity for 
fantasy was considered to be the ability to 
admit to awareness a painful experience. Some 
qualitative results in a previous study (33) 
had suggested interesting individual differ- 
ences in admission of distress by Ss follow- 
ing a failure experience which had measur- 
ably interfered with intellectual performance. 
A crude measure of admission or denial of 
disturbance following failure was therefore 
included in the test battery with the hypothe- 
sis that it would carry an appreciable loading 
on a fantasy factor. A somewhat similar ra- 
tionale led to the inclusion of a technique 
which was felt to measure compliance with 
the examiner’s wishes and persistence in a 
routine task to please another person despite 
psychological satiation. Finally, some attempt 
was made to obtain a measure of planning 
ability or foresight since this seems so clearly 
a characteristic of a well-developed fantasy- 
life. One approach involved self-ratings on a 
simple scale devised by Small (26). The other 
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called for use of the Porteus Mazes, long de- 
scribed as measures of foresight by their 
originator (17), a claim afforded at least 
some support in a recent study by Small 
(26). The Porteus Mazes would appear par- 
ticularly relevant since success in their exe- 
cution involves the capacity both to delay 
motor reactivity, curbing impulsive solutions, 
as well as the more ideational component of 
visually moving through the mazes in ad- 
vance of the pencil. 

Turning now to tasks and situations involv- 
ing delaying capacity and control of motility, 
one possibility that suggested itself was a 
time-estimation procedure. Since the large 
majority of Ss tend to respond too soon when 
instructed to indicate when a stated time in- 
terval has elapsed, sheer accuracy gave an 
adequate measure of delaying ability and 
control of impusive reaction. A better meas- 
ure might have been to consider excessive 
delaying, but this occurred too rarely in prior 
trials of the procedure. A measure of motor 
inhibition, a slow-writing task originally de- 
vised by Downey, was included. A number 
of previous studies had suggested a consist- 
ently positive relationship between the ability 
to delay writing tempo and Rorschach’s M 
response (14, 23, 24). To obtain some evalua- 
tion of longitudinal gross motor tendencies, 
ratings on certain variables were obtained 
from ward personnel who knew the Ss for at 
least three months. Four variables based on 
the ward behavior items drawn from the 
Multidimensional Rating Scales (12) were 
employed. One involved aggressive behavior, 
frequently acting out a hostile impulse either 
through physical assaults or verbal impreca- 
tions. Another, cooperativeness with ward rou- 
tine, presumably an example of controlled 
motility, would be expected to show a posi- 
tive loading on a fantasy-motility factor. A 
measure of external interests, as manifested 
by behavior demonstrating some measure of 
orientation toward the hospital activities pro- 
gram and other people, was included. Finally, 
a series of items dealing with diffuse energy 
expenditure, gross motor activity not evi- 
dently hostile in intent, was employed, which 
might be expected to yield a negative or zero 
loading on a fantasy-motility factor. 

In summary, then, the specific form of the 











Delaying Capacity, Fantasy, and Planning Ability 


investigation involved the prediction that a 
battery of procedures which on the surface 
measured a wide variety of performances 
ranging from fantasy capacity as seen in 
storytelling, perception of humans in activity 
on inkblots, planning in a maze test, slow- 
writing, and controlled motility, would all be 
revealed as aspects of an underlying, more 
general dimension in which the capacity for 
imaginal living and control of impulsive mo- 
tility were linked. Since clinical experience 
and some previous research had suggested the 
value of Rorschach’s Human Movement re- 
sponse as a measure of both the fantasy and 
motility aspects of the assumed underlying 
factor, it was anticipated that Rorschach’s M 
response and the threshold for Human Move- 
ment on Barron’s inkblots would show espe- 
cially high loadings on this fantasy-motility 
factor. 


Method 


Subjects. There were 100 Ss, all male vet- 
eran patients bearing the diagnosis of schizo- 
phrenia. They were chosen chiefly on the 
basis of their having recently received a Ror- 
schach examination and an intelligence test. 
Their average age was approximately 31, 
mean date of first hospitalization 1949, mean 
IQ 108, and their years of education aver- 
aged 10.5. Social class background for this 
group was predominantly lower-middle and 
upper-lower and their ethnic backgrounds ap- 
proximated the general hospital picture. Al- 
though all the patients in this study were un- 
questionably psychotic with hallucinations or 
delusional trends, only those Ss who proved 
to be in sufficient contact to follow instruc- 
tions for all of the tests were employed in 
the sample. It should be noted that the pa- 
tients in this study represented a different 
sample of Ss from those used in two prior in- 
vestigations in this series (23, 24). 

Procedure. A battery of special procedures 
was organized with an instruction booklet, 
and a scoring sheet.? With the exception of 


2 Copies of the correlation matrix are available on 
request from the authors. Tables giving the inter- 
correlation matrix, the transformation matrix, in- 
structions for the various test techniques, and sam- 
ple data sheet have been deposited with the Ameri- 
can Documentation Institute. Order Document No. 
4968 from ADI Auxiliary Publications Project, Pho- 
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the Rorschach and Wechsler-Bellevue, which 
had usually been given some weeks or oc- 
casionally months earlier, all the data were 
obtained at one 90-minute sitting. The be- 
havior ratings were obtained shortly after 
testing by interviews with ward personnel. 
Much of the testing was carried out by two 
of the authors (J. L. S. and H. W.) with oc- 
casional administrations of the battery by 
clinical psychologist trainees who had re- 
ceived instruction in the testing procedures. 


Wechsler-Bellevue Intelligence Scale (Variable 1). 
For various practical reasons it was not possible to 
obtain full-scale IQs on all Ss, but a prorated IQ 
estimate was obtained. These estimates were based 
on verbal tests usually, and only occasionally were 
performance test IQs available. 

Rorschach. The Rorschachs were standardly ad- 
ministered examinations scored in accordance with 
Klopfer and Kelley (10). All protocols were re- 
checked by conference to eliminate the few ques- 
tionable scorings of the original examiners. The 
remaining test battery was administered by the in- 
vestigators without prior knowledge of Rorschach 
data for the Ss in order to eliminate unconscious 
influence by the examiner. All determinants were 
scored but only those with sufficient frequency of 
occurrence and range in the sample were employed 
in the analysis. The Rorschach scores with Variable 
Numbers were: Total Responses (R, var. 2), Hu- 
man Movement (M, var. 3), Animal Movement 
(FM, var. 4), Form-level (Beck’s F + %, var. 5), 
Black or Gray shading (Klopfer’s C’, Beck’s Y, var 
6), Form-Color (FC, var. 7), and Color-Form (CF, 
var. 8). 

Barron’s Movement Threshold Inkblots (Variable 
9). This test consists of thirty black blots presented 
one at a time for thirty seconds with usual Ror- 
schach instructions. Only one response per card is 
obtained and the score is based on the card in which 
the first clear M response is elicited. For purposes of 
simplifying the factor analysis, the scores on this 
variable were arbitrarily reflected prior to insertion 
in the matrix; a positive correlation in the matrix 
thus means that a low Movement Treshold is asso- 
ciated with a high score on the other variable. 

Thematic Apperception Test Transcendence Index 
(Variable 10). Four cards of the TAT were adminis- 
tered in accordance with instructions developed in 
an earlier study (24). The cards were 1, 6BM, 7BM, 
and 12BM. The Transcendence Index, adapted from 
work by Weisskopf (29) called for scoring of items 
that involved addition of material rot directly on 


toduplication Service, Library of Congress, Wash- 
ington 25, D. C., remitting in advance $1.25 for 
microfilm or $1.25 for photocopies. Make checks 
payable to Chief, Photoduplication Service, Library 
of Congress. 
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the card, eg., mew characters, names, past or fu- 
ture events, etc. 

Planning Self-Rating (Variable 11). This simple 
six-point rating scale was used by Small (26) in a 
study of planning tests. It calls for S to rate himself 
along a scale of planfulness in which each point is 
defined. The points range from: “I never plan things 
in advance,” through “I always plan things out care- 
fully in advance.” 

Porteus Mazes (Variable 12). This test was given 
in accordance with instructions in Porteus’ Manual 
(17). Only the quantitative score (TQ) was em- 
ployed here although further analysis by qualita- 
tive means is being planned. 

Wechsler’s Number-Square Test (Level of Aspira- 
tion). This procedure, modified from a _ technique 
devised by David Wechsler, called for S to draw 
squares around numbers printed serially in rows of 
ten across a page. Instructions called for speed and 
30 seconds were allotted for a trial of 40 numbers. 
Prior to the first trial Ss were asked to estimate 
their expected performance given only the infor- 
mation that “people do between 18 and 30 in half-a- 
minute.” Following performance a second estimate 
was obtained. This procedure was carried out twice 
so that there were in all three expectation scores 
(one prior to performance) and two actual perform- 
ance scores. Measures included Initial Expectation 
score (£:, Var. 13) and Level of Aspiration E.+ Es 
—P,+P, (L.A. Var. 15). The score obtained on 
initial performance (P:, Var. 14) was also em- 
ployed as a measure of sheer speed of reaction. 

Time Estimation (Variable 16). This task called 
for S to indicate by saying “Stop” when he thought 
that a given time period had elapsed. Three times 
were used, 15, 30, and 60 seconds. Deviations from 
actual time were added to give a total deviation 
score. The measure used was Accuracy, based on 
absolute deviation. 

Block Authority-Reaction Test (Variable 17). 
This procedure modified from a technique devised 
by the Blocks (2), called for Ss to put ten pegs in 
holes and remove them at their own pace, repeating 
this procedure until their psychological satiation 
point was reached. The Ss were informed that they 
could stop at will, but when they did so E asked, 
“Wouldn’t you like to do some more?” and then, 
“Are you sure you'd like to stop now?” If Ss con- 
tinued the task in this ambiguous situation, they 
were scored for positive response to authority. 

Motor Inhibition Test (Variable 18). This tech- 
nique, derived originally from the Downey Will- 
Temperament Scale, had been used in a number of 
studies in this series (14, 23, 24). It called for S to 
write the phrase, “New Jersey Chamber of Com- 
merce,” as slowly as possible without stopping mo- 
tion of the pencil. The score was the time in seconds 
taken to complete the phrase. The difference between 
an initial trial at normal tempo and the experimen- 
tal trial was the actual measure. 

Digit Frustration, Admission of Disturbance (Vari- 
able 19). This task, employed originally in a study 
on frustration by Wilensky (33), involved straight- 
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forward digit retention except that S was taken be- 
yond his limit of success, and his errors on the digit 
series beyond his memory span pointed out. Follow- 
ing a standard number of failures, the digit spans 
were presented in decreasing length until a long pe- 
riod of sustained success occurred. Evidence of dis- 
turbance was based on the difference between initial 
level and postfrustration level of digit retention. The 
variable employed here was the admission or denial 
of disturbance following failure. This was a di- 
chotomous classification; those patients whose re- 
sponses were unintelligible or who actually refused 
to respond were not considered, and consequently 
the N for this variable was lower than for the others. 

Behavior Ratings. To obtain some evaluation of 
long-term motility patterns, items from the Multi- 
dimensional Rating Scales were grouped under four 
headings: those pertaining to Aggression (Var. 20), 
verbal or physical assaultive and destructive behav- 
ior on the ward, Cooperativeness (Var. 21) with 
ward routine, Interest Level (Var. 22), as seen in 
voluntary participation in ward activities, and Dif- 
fuse Energy Level (Var. 23), excessive motor ac- 
tivity not specifically hostile in intent. Ward per- 
sonnel, aides, nurses, physicians, and ward clinical 
psychologists, all of whom had daily contact with 
the patients in question as well as with the hospital’s 
daily listing of unusual behavior on each ward, were 
interviewed and each patient rated on a three-point 
scale for each item in the four categories. The rater 
was not cognizant of test results at the time of his 
interviews and ratings. 


Results and Statistical Analysis 


Following inspection of the various dis- 
tributions, the scores on each variable were 
dichotomized at the median. In almost all 
cases good splits were obtained, the excep- 
tions being the Authority-Reaction, where 
only 30% of Ss met the criterion of compli- 
ance and the Energy-Level variable, where 
the cut was hampered by an extremely re- 
stricted range of scores. Following dichoto- 
mization, phi coefficients were obtained and 
transformed to estimates of Pearson’s r in 
accordance with the procedure suggested by 
Edwards (5). A comparative sampling of tet- 
rachoric coefficients revealed no sizable dif- 
ferences in the coefficients obtained by the 
two procedures—most deviations being of the 
order of 0.01 to 0.03. Recognizing the limita- 
tions of rs obtained in this fashion, the au- 
thors are following Wittenborn’s reasoning 
(34) in attempting a factor-analytic study 
of Rorschach data. 

A cluster analysis had led to the elimina- 
tion of a number of variables which showed 
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Table 1 
Centroid Loadings, Communalities, and Common Factor Loadings 
Variables I II Ill IV h? A B c D 
1. IQ (W-B) 27 26 —06 —17 17 01 25 19 —12 
Rorschach Variable 
2. R 72 27 21 18 66 07 35 53 44 
3. M 82 —67 25 —Q2 1.18 81 10 —27 65 
4. FM 52 24 32 32 53 —07 27 52 52 
5. F+% 32 —i1 11 —17 16 22 21 —08 14 
6. C’ 25 55 —19 10 41 —19 14 57 —14 
7. FC 46 29 —16 33 43 07 —03 56 15 
8. CF 22 18 —16 28 18 02 —11 37 06 
Experimental Battery 
9. Mt 60 —47 14 33 71 56 —14 01 62 
10. TAT 58 27 30 06 50 —03 44 42 38 
11. PSR 38 28 08 —47 44 02 55 07 —13 
12. Port 45 —12 —28 —24 36 48 03 —07 —13 
13. E, 38 16 38 —52 58 —01 71 —06 09 
14. P,; 31 42 21 —41 48 —18 64 i8 —07 
15. LA 11 —30 13 —1i1 14 23 04 —27 15 
16. TIM 35 12 —18 12 18 17 —03 28 02 
17. AR 24 —16 39 30 33 04 07 11 57 
18. MIT 74 —25 —23 —10 67 69 00 00 12 
19. AD 45 —26 36 20 4 24 12 05 60 
20. Agg 19 36 —16 17 22 —11 02 44 -06 
21. Coop 19 —54 —26 — 34 52 4 —16 —53 —15 
22. Ex I 28 27 — 50 —49 64 23 16 04 —59 
23. En L 15 33 —29 07 22 —04 —03 35 —21 
negligible correlations in the matrix, e.g., Movement-Threshold show sizable loadings 


Age, Date of First Hospitalization, Education 
(which correlated chiefly with IQ). The re- 
sulting matrix of 23 variables was then sub- 
jected to a factor analysis, using Thurstone’s 
centroid method (27). Communalities were 
estimated by means of minor centroids. The 
relatively small residuals remaining in the 
third residual matrix and the unknown reli- 
ability of the method of estimating r war- 
ranted extraction of only four factors. The 
axes were rotated to oblique simple structure 
following Thurstone’s method of rotation by 
two-dimensional sections. The centroid fac- 
tors, the calculated communalities and ob- 
lique factor loadings are presented in Table 1. 
Since the oblique factors show slight inter- 
correlations, a second-order factor analysis 
was carried out. The matrix of intercorrela- 
tions of the factors, the second-order centroid 
factor loadings, and the rotated factor load- 
ings are presented in Table 2. 

Consideration of Factors A and D sug- 
gests at least partial support for the hypothe- 
sis of the study since Rorschach M and 


on the two factors in which measures of con- 
trolled motility and fantasy are also appreci- 
ably loaded. Factor A derives by far its high- 
est loadings from M, Movement-Threshold, 
Porteus Mazes, Motor Inhibition, and Coop- 
eration in Ward Behavior. It thus appears to 
reflect controlled motility primarily and it 
seems possible that it represents the kind of 
motor restraint that would lead to compli- 
ance and inhibition of impulses. Factor D, 
on the other hand, with its sizable loadings 
on M, Movement Threshold, Admission of 


Table 2 


Intercorrelations Among Factors, Second-Order 
Centroid Loadings, and Oblique 
Rotated Loadings 











Factor A BC D I Il # AA BB 
A 40 44 —15 68 —16 49 15 45 
B 12 —17 51 —39 41 —14 58 
Cc —22 58 29 42 50 00 
D —36 —19 17 —32 —01 
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Distress after Failure, Animal Movement, 
External Interests (negative loading), Con- 
tinuation of Task in Response to Authority, 
Rorschach Responses, and T'ranscendence on 
TAT stories, appears to represent an idea- 
tional or introspective dimension of person- 
ality, particularly since all of the tasks in- 
volve little or no gross motor activity. Factor 
B might well be termed an ambition or need- 
achievement factor since it combines high 
loadings on productivity measures (Perform- 
ance Speed, Rorschach Responses, and TAT 
Transcendence) with self-estimates of achieve- 
ment potential (Initial Expectation on Num- 
ber Square Test) and planfulness (Planning 
Self-Rating). Factor C with its high loading 
on Rorschach Color and Shading, Animal 
Movement, Rorschach Responses, Coopera- 
tion (Negative), Aggressive Ward Behavior, 
TAT Transcendence, and Energy Level in 
Ward Behavior, seems to represent a com- 
bination of productivity or drive with emo- 
tionality and uninhibited activity. The high 
loading of Rorschach C’ and FM would ap- 
pear to support certain views of Piotrowski 
(16), who has suggested that numerous C’ 
and FM responses in Rorschach records are 
indicative of acting-out of impulses and ani- 
mation, respectively. Factor C may well rep- 
resent some aspect of a pressure for outward 
direction of living, perhaps a flight into ac- 
tivity and environmental manipulation which 
is manifested both motorically (high Aggres- 
sion and Energy Level, low Cooperation) and 
ideationally (high TAT Transcendence and 
Rorschach Responses). It closely resembles 
a factor termed “Emotional Surgency” by 
Foster (6) * in a somewhat similar study of 
Rorschach data. 

In the second-order analysis, Factor AA 
has a high positive loading for Emotional 
Surgency and a moderate negative loading 
on Introspectiveness. This result suggests at 
least some validity to Rorschach’s original 
concept of the experience type. Factor BB 
is clearly defined by high loadings in Con- 


8 A recent paper by Foster (6) which came to the 
attention of the authors subsequent to the comple- 
tion of this study also describes a factor termed 
“emotional surgency” which has high loadings for 
Rorschach shading and color responses. 
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trolled Motility and Ambitiousness or Need- 
Achievement. 


Discussion 


The results of the present investigation, 
while grossly supporting the formulation ad- 
vanced at the outset, make it clear that, at 
least for this sample, the initial hypothesis of 
a fantasy-motility dimension was overly sim- 
plified. The fact that Rorschach’s Human 
Movement response showed significant cor- 
relations with almost all of the variables pre- 
sumably measuring fantasy, planning or de- 
laying ability, and motor inhibition is striking 
confirmation of the original formulation of 
Rorschach (20) concerning Human Move- 
ment and of subsequent interpretations of its 
theoretical meaning and behavioral implica- 
tions (14, 18, 22, 30). Although Rorschach 
M and Barron’s Movement-Threshold show 
high loadings on both Factors A and D, the 
emergence of two separate and relatively un- 
correlated factors, one primarily involving 
motor control and the other ideational ac- 
tivity, suggests a more complex situation 
than was hypothesized. It seems possible that 
at least two types of persons are likely to 
present Human Movement responses to ink- 
blots. One type consists of persons with con- 
siderable ability to restrain impulsive re- 
sponse and to plan ahead and cooperate for 
their own good in the numerous hospital 
situations involving waiting or complying 
with routine. The other group, while sensitive 
to the wishes of authority and perhaps willing 
to comply, is too withdrawn into introspec- 
tion and self-evaluation or autistic reverie to 
make effective use of their potentialities for 
effective motor control and planfulness. This 
speculative distinction suggests more exten- 
sive qualitative exploration of the nature of 
Movement responses, perhaps to determine 
whether differences in content or elaboration 
of the percept may account for the associa- 
tion of M responses with the two separate 
patterns of response. Inasmuch as the two 
Human Movement measures have high load- 
ings on both the Motor and Ideational fac- 
tors in this study, the theory linking move- 
ment perception with controlled motility and 
fantasy receives some support. It may well 
be that the split factors which have emerged 
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may be a consequence of the pathology of the 
patients and that study of a normal sample 
might clarify this issue considerably. Should 
fantasy-motility linkages emerge in other 
studies of this sort, the possibility of an op- 
erational formulation of the concept of ego 
functions may open important research ave- 
nues. At a theoretical level, it may become 
feasible to explore the nature of individual 
differences in thinking and motility as a func- 
tion of family constellation or cultural back- 
ground (25). From a practical standpoint the 
development of a battery of tests of the sort 
described here may afford an opportunity to 
improve the nature of individual psychologi- 
cal testing by the use of brief tests tapping 
separate aspects of a common function. Such 
a development has indeed been advocated by 
investigators like Zubin (35), who have been 
critical of the interpretative complexities in- 
herent in the use of global projective devices 
like the Rorschach. 

The two remaining factors which emerged 
in this study, while not anticipated, provide 
interesting possibilities for fruitful study as 
well. Emotional Surgency, with its loadings 
on ideational productivity, aggressive or dif- 
fuse motor behavior, and Rorschach Color 
and Shading, calls attention to the possibili- 
ties for exploring still another dimension of 
perception and action. It is possible that some 
element of lack of perceptual control (34) 
goes along with a concomitant inability to 
restrain motor behavior. This hypothesis 
would not sufficiently explain the high load- 
ing of Rorschach Animal Movement responses 
and of Form-dominated Color responses in 
this sample, however, and it remains to be 
seen whether further studies duplicate these 
results. Factor B, tentatively termed Ambi- 
tiousness or Need-Achievement, suggests the 
picture of individuals who feel compelled to 
assert that they are planful and can do well 
in a task they have not attempted before and 
press hard for such accomplishment. It may 
be important to note that IQ, which does not 
load appreciably on any of the factors, has 
its highest value in Factor B. Thus, at least 
for this sample of schizophrenics, it ap- 
pears that a personality dimension involving 
achievement drive and a need to appear plan- 


ful may be delineated and proves to be rela- 
tively independent of fantasy tendencies. 

A final question that may be raised in con- 
nection with these findings of the second- 
order analysis is whether the four factors 
bear any diagnostic psychopathological sig- 
nificance. One might, for example, speculate 
on a hierarchical ordering of Factors BB and 
AA relative to an ideal criterion of mental 
health (or well-integrated ego functioning). 
Thus, Factor BB with its pattern of planful- 
ness, motor control, and ward cooperative- 
ness as well as ambitiousness, might repre- 
sent a more effective level of functioning than 
Factor AA, where there is a suggestion of ex- 
cessive withdrawal (Low External Interests) 
and Emotional Surgency with its indication 
of aggressive behavior and uncooperativeness 
as well as diffuse motor activity. Since dura- 
tion of illness and florid symptomatology did 
not appear to differentiate between various 
subgroups in this sample, such an ordering of 
psychopathological tendencies seems less ten- 
able although it remains reasonable to ex- 
pect that an optimal association of fantasy 
and delaying capacity would lead to more 
effective living. For the present these results, 
like those of Barron (1), suggest that differ- 
ing styles or patterns of fantasy and motility 
behavior are indeed discriminable through the 
use of relatively simple psychological testing 
procedures. It seems clear that many of the 
problems raised by consideration of these re- 
sults are relevant for general personality the- 
ory and indicate the value of research that is 
not geared toward differentiating one dubious 
psychiatric nosological group from another, 
but rather toward clarifying the interrelations 
of certain key human functions as the basis 
for what may ultimately become a more 
meaningful and objective system of person- 
ality classification. That the delineation of 
functional personality dimensions is not purely 
speculative is suggested by the fact that Fac- 
tors A, D, and C here bear some reseniblance 
to factors reported by Foster (6) as “Delay” 
and “Emotional Surgency,” by Cattell (4) as 
“Nervous, alert reactivity” and “Cultured in- 
trospective self-control,” as well as by Thur- 
stone (28) as “Reflectiveness” and “Impul- 
sivity.” The work of Block and Martin (3) 
in delineating “Under-” and “Over-control- 
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ling” children seems also to bear directly on 
these factors. 


Summary and Conclusions 


A number of concepts drawn from the psy- 
choanalytic theory of thought development, 
Werner’s sensory-tonic theory of perception, 
and Rorschach’s work on the Human Move- 
ment response (M), have led to the postula- 
tion of a dimension of personality which in- 
volves a linkage of fantasy tendencies with 
control of motility and impulsive behavior. 
In order to test this hypothesis a battery 
of tests which presumably measured fantasy 
tendencies, level of aspiration, planning abil- 
ity, delaying capacity, motor inhibition and 
motility, and responsiveness in ward behav- 
ior was developed. The battery was adminis- 
tered to 100 male schizophrenic patients. A 
factor analysis yielded four centroid factors 
which were subjected to an oblique rotation 
to simple structure. The four rotated factors 
were tentatively designated as: 

1. Motor Inhibition and Planfulness (high 
loadings on Rorschach M, Movement-Thresh- 
old, Porteus Maze Test Quotient, Motor In- 
hibition Time, and Cooperativeness in ward 
behavior) ; 2. -mbitiousness or Need-Achieve- 
ment (high loadings on Initial Expectation 
in a level-of-aspiration situation, Initial Per- 
formance in the same task, Planning Self- 
Rating, Transcendence in TAT stories, and 
Rorschach Response Total); 3. Emotional 
Surgency (high loadings on Rorschach Re- 
sponses, Animal Movement, Shading and 
Color, TAT Transcendence, Aggressiveness 
on the ward, Uncooperativeness, and Diffuse 
Motor Activity); 4. Introspectiveness or In- 
troversiveness (high loadings on Rorschach 
M, Animal Movement Responses, Movement 
Threshold, TAT Transcendence, Admission of 
Distress following Failure, Continuation of 
Task in Compliance with Authority, and 
Lack of External Interests in Ward Be- 
havior). 

The high loadings of Movement-threshold 
and Rorschach’s M on both the Motor In- 
hibition and the Introspective Factors sug- 
gest partial support for the original hypothe- 
sis but the emergence of two separate factors 
for motility and fantasy activity indicates 
that the original formulation was overly sim- 
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plified. The Emotional Surgency factor af- 
fords some support to Piotrowski’s analysis 
of the behavioral implications of Rorschach 
Shading and Animal Movement responses. 
Two second-order factors, one linking Emo- 
tional Surgency and lack of Introspection and 
the other Motor Inhibition and Ambitious- 
ness also emerged. 


Received December 8, 1955. 
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The Development of a Short Form of the 
Manifest Anxiety Scale’ 


A. W. Bendig 


University of Pittsburgh 


Two studies (1, 2) have indicated that a 
majority of the 50 items on Taylor’s Manifest 
Anxiety Scale (MAS) are not valid in pre- 
dicting clinical criteria of manifest anxiety. 
The consistency in item validity results of 
independent studies suggests that a short- 
ened form of the MAS, retaining only the 
valid items, might be more useful and clini- 
cally valid than the standard MAS. However, 
such a shortened MAS might lack the desir- 
able normative characteristics, such as high 
reliability, of the longer MAS. 

The 20 most consistently valid MAS items 
in the reports noted above (Item 1-16 and 
29-32 in 2, p. 360) were selected as a short- 
ened form of the MAS. The 50-item MAS 
was administered to 744 college Ss and their 
papers scored for the 20-item scale. The 20- 
item scale, omitting the 30 “nonvalid” items, 
was given to an additional 324 Ss. No sig- 
nificant differences in scale means or vari- 
ances were found for the two methods of 
administration, nor were any sex differences 


1An extended report of this study may be ob- 
tained without charge from A. W. Bendig, Dept. of 
Psychology, University of Pittsburgh, Pittsburgh 13, 
Pa., or for a fee from the American Documentation 
Institute. Order Document No. 4990 from ADI Aux- 
iliary Publications Project, Photoduplication Serv- 
ice, Library of Congress, Washington 25, D. C., 
remitting $1.25 for microfilm or $1.25 for photo- 
copies. Make checks payable to Chief, Photodupli- 
cation Service, Library of Congress. 


evident. A survey of studies using the 50- 
item MAS shows its median internal consist- 
ency reliability to be .82, while the similar 
reliability of the 20-item scale was .76. Three 
scores were obtained for 100 random Ss who 
had taken the 50-item form: (a) score on all 
50 items, (0) score on the 20 “valid” items, 
and (c) their scores on the 30 “nonvalid” 
items. The reliabilities of the three scores 
were: (a) .78, (6) .76, and (c) 48. The in- 
tercorrelations among the scores were: (a) 
and (b), .93; (a) and (c) .91; and (0) and 
(c), .70. 

It was concluded that the 20-item Pitts- 
burgh revision of the MAS (a) has eliminated 
from the standard MAS items of low internal 
consistency and validity; (4) provides scores 
that are about as reliable as the 50-item MAS 
and are highly related to scores on the stand- 
ard form; and (c) is more parsimonious of 
testing time and probably more valid than 
the longer MAS. 


Brief Report. 
Received June 25, 1956. 
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Perception of Apparent Motion in Organics 
and Schizophrenics 


Rayford T. Saucer 
VA Hospital, Perry Point, Maryland 


and Herdis L. Deabler 
Gulfport Division, VA Center, Biloxi, Mississippi 


Klebanoff, Singer, and Wilensky, in their 
article summarizing (3) recent findings con- 
cerning the psychological consequences of 
brain lesions and ablations, make the follow- 
ing statement: 


Perceptual research with brain-injured patients 
suggests that the repertory of the clinical psycholo- 
gist in the examination of neurological patients may 
soon be augmented by the use of specialized labora- 
tory methods such as tachistoscopic presentation, 
critical flicker fusion frequency testing, apparent 
movement thresholds, and other so-called “brass in- 
strument” techniques. The value of these perceptual 
techniques in localization of lesions remains equivo- 
cal. It seems clear, nevertheless, that subtle percep- 
tual impairments or reorganizations following cere- 
bral injury are most clearly brought out by intensive 
laboratory examination rather than by the widely 
used global clinical methods .. . (3, p. 11). 


The present paper is a report of an experi- 
ment in the perception of apparent motion, 
together with its theoretical bases and impli- 
cations, and attempts to show that as a re- 
sult of increased knowledge in this area, fu- 
ture research in it may lead to the develop- 
ment of a reasonably critical diagnostic in- 
strument. 

Werner and Thuma (10) found that brain- 
damaged children failed to report perception 
of tachistoscopic apparent motion. Relating 
the deficit to Goldstein’s theory of percep- 
tion, they attributed it to inability to organ- 
ize figure-ground relationships, which they 
held to be a fundamental principle of neuro- 
physiological organization. Shapiro (9) at- 
tributed the deficit to satiation in the cortex, 
equating the effect of continuous visual stimu- 


lation to the effect of organic damage, in ac- 
cordance with Kohler and Wallach’s (4) the- 
ory of satiation. 

Price and Deabler (6), in a study de- 
signed to extend Freeman and Josey’s (1) 
work on perception of the apparently moving 
afterimage induced by a rotating Archimedes 
spiral, report that 98 per cent of the known 
organic patients tested had difficulty in per- 
ceiving or failed to report this distinctive 
illusion. In discussing their results, they sug- 
gest that either a field theory or a theory of 
perception proposed by one of the writers 
may be used to explain their findings. 

Earlier work in the field of apparent mo- 
tion by one of the writers (7, 8) has led to 
the development of new techniques for the 
study of apparent motion. In the present 
study, apparent motion is produced by use 
of two alternately flashing light bulbs which 
are 1i-inch neon-filled, fluorescent coated 
Westinghouse Nite-Lites mounted on a verti- 
cal black board. These are illuminated alter- 
nately, with zero pause time between flashes, 
by a multivibrator square wave generator, the 
circuit of which is conventional. 

It is found that varying the rate of alter- 
nation of light flashes from 2 to 10 per second 
causes the light stimuli to appear in three 
phases which are directly related to the fre- 
quency of flashes. As the frequency increases, 
the stimuli may be perceived and described 
as (a) successive alternating appearance of 
each light; (5) apparent movement wherein 
the two lights seem to fuse into one light 
which appears to swing smoothly from side 
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to side; and (c) simultaneous appearance 
wherein the two lights are seen and perceived 
as flickering. The distinctive apparent move- 
ment phenomenon perceived in Phase 5 was 
observed by Wertheimer in 1912; he referred 
to it then as “optimal movement” (11). It 
has since been investigated by numerous psy- 
chologists (2, p. 899), being uniformly desig- 
nated as Beta movement, by which term it 
will be referred to in this study. 

If the light stimuli are made diffuse, with 
no sharply defined boundaries, such as is ac- 
complished by use of a ground-glass screen 
placed about one inch in front of the bulbs, 
then the last state, Phase c, may result in 
a phenomenon earlier reported (7) called 
Omega movement. Omega movement appears 
as movement of a dark, quasi-formless, amor- 
phous object. It has movement but differs 
from Beta in that it does not have the at- 
tributes of the primary stimulus. It has a 
minimal amount of “thing” quality and is 
frequently reported as “a dark shadow mov- 
ing back and forth.” 

The body of evidence indicates a deficit in 
organics in their ability to perceive apparent 
motion. It is felt that the deficit reported by 
Price and Deabler may be a phase of a gen- 
eral disturbance of the perception of appar- 
ent motion. The present study is designed to 
explore further the nature of this deficit. 

A theory of motion perception has been 
previously presented (7, 8). This theory 
states that motion perception is based on a 
matrix synthesis of perceptual organizations 
isomorphic or related to stimulus detail and 
to temporal aspects of the stimulus configura- 
tion. Beta motion, in which the apparently 
moving object has the attributes of the pri- 
mary stimulus, may be regarded as a syn- 
thesis of stimulus detail and motion. When 
the rate of alternation is increased from that 
which gives optimal Beta motion, the com- 
plex or synthesis breaks apart or dissociates. 
Perception of Beta movement is lost and the 
observer reports seeing either the two lights 
as stationary or the Omega movement phe- 
nomenon. Practically speaking, the threshold 
for loss of Beta is the same as for appearance 
of Omega. It is felt that the ability to sustain 
Beta perception at higher levels of complexity 
is dependent upon the functional efficiency of 
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the cerebral cortex. The threshold for loss of 
Beta perception may thus lend itself as a 
measuring index for determining cortical effi- 
ciency or impairment. 

It is to be expected that organics with 
their decreased cortical efficiency will have 
lower thresholds for loss of Beta or for ap- 
pearance of Omega than normals. Schizo- 
phrenics by operational definition and by 
their performance on most psychological tests 
are known to have low cortical efficiency also. 
They are included in this study to determine 
whether or not this is reflected in lower 
Omega threshold or whether this psychiatric 
group remains essentially normal with respect 
to ability to perceive apparent motion. 

It is the specific hypothesis of this study 
that as the rate of alternation of the two 
lights is changed, organics will have a lower 
experimental threshold for Omega or loss of 
Beta than normals, and it is highly probable 
that schizophrenics will also be lower than 
normals. 


Subjects and Method 


Four groups of 10 male subjects per group 
were used in this experiment, three of the 
groups being drawn from the patient popula- 
tion of a psychiatric hospital and one group 
from the population of nursing aides and at- 
tendants. The groups were designated as or- 
ganic, schizophrenic, lobotomized, and con- 
trol. 


Diagnosis of patients by the hospital staff was used 
as the criterion for grouping. The official diagnosis 
for each patient represented the opinion of psy- 
chiatrists, psychologists, and occasionally, psychiatric 
consultants. Patients were selected from the current 
case book of the Psychology Service and represented 
a random sampling of a hospital population. It is 
believed that they were typical for their particular 
diagnostic groups. 

Although many organic subjects had a history of 
one or more brain injuries, no particular category of 
organic involvement was favored. Alcoholic and 
senile syndromes were excluded. Practically all or- 
ganic patients tested carried a secondary or qualify- 
ing diagnosis of “with psychotic reaction.” 

Schizophrenics were taken from the category of 
schizophrenic reaction, undifferentiated type. If a his- 
tory of concussion damage, blackouts, or other dis- 
orders of consciousness was elicited, subjects were 
not used. All schizophrenic patients used in the ex- 
periment had experienced delusions or hallucinations 
at some period during their psychosis. Patients with 
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a history of ECT or ICT during the previous three 
months were not used. 

Ten Freeman-Watts lobotomy cases were also in- 
cluded in the study for comparative purposes. All 
were diagnosed chronic schizophrenics prior to the 
lobotomy. 

The controls were selected at random from male 
aides employed in the psychiatric hospital. They 
were used because it was felt that they represented 
a group of individuals of average intelligence and 
similar cultural background to the majority of the 
patients used. 


Subjects were brought into a darkened room 
and seated two meters from the light source. 
In a pilot study, it was found that a flash rate 
ef 2.5 flashes per second gave optimal Beta 
movement for most subjects. This rate was 
then used as a baseline for the experiment. 
The rate control was set at 2.5 f.p.s., and 
each subject was asked to report what he saw. 
Of the 40 subjects tested, two organic sub- 
jects failed to report Beta movement; of 
these, one reported Omega and another re- 
ported seeing no motion of any description. 
The flash rate was then raised in steps of 0.5 
f.p.s. every 30 seconds until the subject re- 
ported Omega movement. In a few cases the 
Beta organization was broken up by simul- 
taneous appearance of two fused lights, which 
represented a high threshold of dissociation. 
This point was equated to the perception of 
Omega motion (8). 


Results 


The rate at which Omega motion became 
visible or the rate at which the two lights 
appeared simultaneously was taken as the 
threshold of dissociation of Beta movement. 
The mean rates for the diagnostic groups and 
their associated standard deviations are given 
in Table 1. 

It can be seen from Table 1 that the means 
can be ranked from organic through normal. 
Analysis of the significance of the difference 


Table 1 
Threshold Means and Standard Deviations of Groups 








Schizo- Loboto- 





Measure Organics phrenics mies Control 
Means 3.58 3.96 4.45 4.52 
Sigmas 22 43 .22 14 
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Table 2 
Analysis of Variance 
Sum of 

Source Squares df F 
Total 19.19 39 
Between 5.94 3 5.50 
Within 13.26 36 





of the means is contingent upon the results 
of the analysis of variance and will be pre- 
sented in that order. 

Since a test for homogeneity of variance 
indicated that the variance of the groups rep- 
resents no more than chance distribution, 
analysis of variance for between and within 
categories was applied to the data. The re- 
sults are given in Table 2. 

For 3 and 36 degrees of freedom, an F 
ratio of 5.50 is significant beyond the .01 
level but not to the .001 level of significance. 
The null hypothesis, that these groups repre- 
sent random samples drawn from a common 
population, can be rejected at a satisfactory 
level of confidence. It may then be concluded 
that the threshold for the dissociation of Beta 
apparent motion is related to the presence of 
pathology. 

Following McNemar (5, pp. 74-75), the 
standard error of the difference can be com- 
puted by using the within-variance as the 
best estimate of the population variance. If 
this is done, the standard error of the differ- 
ence for the groups can be based on N — k 
degrees of freedom, where N is 40 and & is 4. 
Thus, 36 degrees of freedom are available for 
the various tests. The various tests are sum- 
marized in Table 3. 


Table 3 


Significance of Mean Differences 








Groups Compared t Significance 
Organic-Schizophrenic 1.5 NS 
Organic-Lobotomy 3.3 01 
Organic-Normal 3.5 O01 
Schizophrenic-Lobotomy 1.7 05 
Schizophrenic-Normal 2.1 025 
Lobotomy-Normal NS 
OS-LN 3.78 01 
OS-N 3.40 01 
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For four of the pairs, a ¢ which is not quite 
significant at the .001 level of significance but 
well beyond the .01 level can be established. 
Two are not significant and two are signifi- 
cant at the .05 level of confidence. Current 
statistical practice sets no level for rejecting 
the null hypothesis, although the writers feel 
that the confidence level should at least ex- 
ceed .01 before the null hypothesis is rejected. 
If we wish to consider these ratios as a CR 
(since 36 degrees of freedom are involved), 
then those ratios above 3.3 rise considerably 
above the .001 level of significance. The 
writers prefer the more conservative evalua- 
tion. 


Discussion 


The primary hypothesis that organics differ 
from normal or control subjects in Omega 
threshold has been established at a highly 
significant level of significance. Thus we may 
expect that as test procedure and instrumen- 
tation are refined, a diagnostic instrument 
may be evolved which will provide a state- 
ment of degree of pathology but not neces- 
sarily indicate the kind. Although schizo- 
phrenics are not distinguishable from or- 
ganics on the basis of this technique, the dis- 
tinction between normals and schizophrenics 
suggests that an apparent-motion perception 
technique may assist in differentiating these 
conditions. 

At the same time, significant F ratio indi- 
cates a significant nonchance order of mean 
scores. If the F ratio be taken as a limited 
case of a correlation coefficient, the Omega 
threshold is then correlated with some un- 
known function. This function is not neces- 
sarily the degree of behavioral deviation ex- 
hibited by the patient. The writers believe 
that the means represent points on a con- 
tinuum of cortical efficiency. 

If the results are examined in the light of 
a neural network frame of reference, they be- 
come meaningful. A matrix theory of percep- 
tion can be expanded to include the entire 
cerebral cortex as a single matrix. Any dam- 
age to any part of this matrix represents a 
loss and hence lowers the functional efficiency 
of the matrix. One of the major functions of 
the cortical matrix is to mediate contact be- 
tween the organism and the environment by 
ordering the information received through 
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sensory channels. The subjects selected as or- 
ganic demonstrate damage to the visual per- 
ceptive mechanisms regardless of the type of 
cortical damage. Evidently perception cannot 
be fractioned into visual or other components 
but is a global effort of the entire cortex. The 
continuum on which the subjects are ranked 
is the continuum of over-all efficiency of the 
cerebral cortex, not the efficiency of the visual 
pathways and visual projection areas alone. 

The organic subject exhibits deficit because 
of damage to neural processes. The schizo- 
phrenic, on the other hand, shows deficit be- 
cause of psychological factors which may or 
may not have a demonstrable isomorphic 
physiological basis. A large part of the cortex 
of the schizophrenic is involved in dealing 
with problems arising from emotions such as 
hostility toward a basically unfriendly envi- 
ronment. Neural matrices normally used in 
mediating environmental contact are dis- 
rupted by being organized into matrices con- 
cerned with problems which we may broadly 
define as emotional or as internal. Such neural 
mechanisms as are involved in these autistic 
processes must be subtracted from the sum- 
total amount of cortical tissue available to 
the organism for the mediation of contact 
with the environment. By a law of mass ac- 
tion, the results are not to be clearly sepa- 
rated from the results of organic damage. 

The interpretation of schizophrenic re- 
sponses may be inferred from the fact that 
when lobotomized patients are tested they 
fall toward the “normal” rather than the 
“deficit” end of the continuum. As has been 
mentioned, these patients were chronic schizo- 
phrenics unimproved by other therapies. 
When the subcortical tracts which mediate 
emotional responses to sensory material are 
out, they tend to return to the norm with re- 
spect to perceptual organization. As the tissue 
damage is mainly in subcortical areas, the 
functional Gestalt of the cortex is not de- 
stroyed. 

From the experimental evidence, it may be 
inferred that perception is a global process 
of the entire cortex and that by measuring 
the amount of organizational force available 
to the individual for use in perceptual proc- 
esses such as perception of apparent motion, 
the functional efficiency of the individual 
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may be measured and presence or absence of 
pathology determined. 


Summary 


Experiments reported by previous investi- 
gators indicated a deficit in the perception of 
apparent motion by organic individuals. The 
hypothesis that the deficit is due to lowered 
organizational ability has been advanced. The 
degree of organizational ability of the indi- 
vidual would be reflected in the threshold for 
dissociation of Beta apparent motion into 
component parts of detail and Omega move- 
ment. 

Organics, schizophrenics, lobotomy cases, 
and controls were examined for threshold of 
Beta dissociation. Analysis of variance indi- 
cates that the null hypothesis—that these 
four groups are random samples of a com- 
mon population with respect to the variable 
measured—can be rejected at beyond the .01 
level of significance. 

It is concluded that the threshold for loss 
of Beta apparent motion can differentiate be- 
tween certain clinical groups, and it is in- 
ferred that these differences are due to or- 
ganic or functional loss of organizing capacity 
of the cerebral cortex as a whole rendering it 
less able to synthesize sensory data. 


Received November 1, 1955. 
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Emotional and Intellectual Concomitants of 
Advanced Chronic Alcoholism 
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Boston Regional Office, Veterans Administration 


Much has been written concerning the ef- 
fects of excessive drinking on human mental 
processes. The chronic alcoholic has been 
variously labeled as psychotic, psychoneu- 
rotic, psychopathic, and intellectually de- 
teriorated. However, these beliefs have been 
based upon studies which have dealt with 
obviously biased samples drawn from such 
sources as alcoholics’ courts, mental hos- 
pitals, psychoanalysts’ case loads, and the 
prisons. 

A sample thought to be more representa- 
tive of chronic alcoholics in the general popu- 
lation was drawn from inpatients on the 
alcoholism ward of a large Eastern general 
hospital. A group of 120 males, virtually all 
the chronic alcoholics admitted for treatment 
during the prescribed period of this study, 
were examined. The source of patients was 
quite varied and there were many case his- 
tories of delerium tremens and visual and 
auditory hallucinosis associated with the use 
of alcohol. These patients had been drinking 
excessively for from 10 to 30 years, were 
acutely alcoholic shortly before admission, 
and were physically debilitated as a result of 
prolonged alcoholic excess. 


1 Formerly Senior Psychologist, Alcoholism Study 
Unit, Philadelphia General Hospital, Philadelphia, 
Pa. Gratitude is expressed for the aid given by Drs. 
Martin D. Kissen, Thomas Downs, and Albert Biele 
of the Philadelphia General Hospital. 

2An extended report of this study may be ob- 
tained from George A. Peters, Jr., 65 Manor Drive, 
Newark 6, New Jersey or for a fee from the Ameri- 
can Documentation Institute. Order Document No. 
4971 from the ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress, 
Washington 25, D. C., remitting in advance $1.75 
for microfilm or $2.50 for photocopies. Make checks 
payable to Chief, Photoduplication Service, Library 
of Congress. 


Administration of the Wechsler-Bellevue 
Intelligence Scale revealed a wide range of 
mental ability, little variation between the 
verbal and performance subtest scores, and 
a slightly higher mean score than found in 
the general population standardization data. 
(Full Scale mean 106.2, SD 13.89; Verbal 
mean 106.1, SD 14.21; Performance mean 
105.1, SD 13.72.) Academic accomplishments 
ranged from below the level of basic educa- 
tion to possession of a doctoral degree. The 
median school grade completed was the tenth 
grade. 

Diagnosis in accordance with the psycho- 
biologic nomenclature of the American Psy- 
chiatric Association indicated that these 
chronic alcoholics were rot primarily psy- 
chotic (8.3%), psychoneurotic (7.5%), nor 
sociopathic (5.0%). The number classified 
as chronic or acute brain syndrome (5.8%) 
indicates that only a small minority devel- 
oped clinical symptoms of central nervous 
system “organicity.” Over half (52.5%) were 
classified as manifesting symptoms indicative 
of a Personality Disorder. 

This group of patients suffering from the 
physical complications of advanced chronic 
alcoholism displayed no evidence of marked 
intellectual deterioration on the basis of gross 
IQ scores. The majority of cases were best 
described as manifesting a variety of person- 
ality disorders rather than psychoses, neu- 
roses, sociopathic, or brain syndromes. If this 
sample is used as a criterion, there is a sur- 
prising lack of major psychiatric disorder or 
restriction of intellectual capability in the 
advanced stages of chronic alcoholism. 


Brief Report. 
Received April 2, 1956. 
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Repression and Insight as Related to Reaction 
to Cartoons 


Seymour Epstein and Richard Smith 


University of Massachusetts 


A commonly accepted theory of humor is 
the Freudian view that repression favors an 
appreciation of humor in which the repressed 
material is represented (7). Although several 
studies on the nature of the humor response 
have been reported, most of which have en- 
dorsed Freud’s view, a critical appraisal of 
these studies indicates that they fail to deal 
directly with the Freudian hypothesis. What 
they do reveal is that (a) individuals show 
a consistent preference for certain types of 
humor (1, 3, 4, 5, 8, 12, 13), (6) preference 
for certain types of humor can be related to 
other personality variables (2, 4, 5, 9), and 
(c) preference for certain types of humor 
can be related to adjustment (12). 

In order to evaluate the Freudian hypothe- 
sis it would be desirable to obtain a measure 
of preference for a certain type of humor, to 
secure an independent measure of repression, 
and to determine whether there is a signifi- 
cant relationship between the two. This is 
what the present study proposes to do. 


Method 


Hostility was selected as the variable to be 
investigated because, as a socially disap- 
proved trait, it is subject to repression and 
there is a considerable amount of humor de- 
voted to it. In order to obtain a measure of 
preference for hostile humor, 16 hostile and 
16 nonhostile cartoons were obtained by se- 
lecting only those unanimously agreed upon 
by independent decisions from five judges. 
The cartoons finally selected were assigned 
random numbers and presented to each S in- 
dividually with two sets of instructions. First 
S was to rate each cartoon by placing it into 
one of three bins according to whether it was 


“funny,” “mediocre,” or “not funny.” As S$ 
observed each cartoon, E (examiner) unob- 
trusively recorded on a checklist whether S’s 
expressive reaction was: (@) a negative verbal 
expression, (5) a negative facial expression, 
(c) no reaction, (d) a positive facial expres- 
sion, (e€) a positive verbal expression. After S 
had completed sorting the cartoons, E shuffled 
them and instructed S to rearrange them in a 
Q sort by placing the funniest cartoons far- 
thest to the right and the least funny farthest 
to the left. The Q sort contained seven col- 
umns with successive frequencies 2, 4, 6, 8, 
6, 4, 2. The Q sort differed from the initial 
ratings in that it required that each cartoon 
be judged relative to all other cartoons. 
Establishing a measure of repression was 
more difficult than measuring reaction to 
humor. According to Freud, “The essence of 
repression lies simply in the function of re- 
jecting and keeping something out of con- 
sciousness” (7, p. 86). Accordingly, it would 
seem reasonable to assume that repressed im- 
pulses would achieve expression in everyday 
behavior to a greater extent than they would 
be consciously acknowledged. Consequently, 
an operational measure of repressed hostility 
was established as the degree to which a per- 
son denied the hostility attributed to him by 
a consensus of persons who knew him over an 
extended period. A measure of insight was 
obtained by considering the absolute dis- 
crepancy without regard to direction (2, 11). 
Thirty-two fraternity members who had 
known each other for at least a year served 
as Ss. A week after having sorted the car- 
toons, each S was again seen individually and 
presented with 32 cards containing the names 
of all Ss, including his own. It was requested 
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that the cards be placed in a Q sort such that 
the person who most approached the follow- 
ing description would be farthest to the right: 
“This person is an individual who tends to 
be inconsiderate, antagonistic, and resentful. 
When talking about others he is apt to em- 
phasize their faults although he does not like 
this done to himself. He has little sympathy 
for others and enjoys putting them on the 
spot. He makes disparaging remarks about 
people and argues for the sake of argument. 
It is important for him to come out on top 
and he is apt to be a sore loser in compe- 
tition.” This definition was made to be rele- 
vant to the manifestations of hostility that 
were apt to occur in group living. The Q sort 
consisted of seven columns with successive 
frequencies of 2, 4, 6, 8, 6, 4, 2. 


Results 


An index of repression and insight was ob- 
tained by subtracting from the self-rating of 
hostility on the Q sort, the average rating by 
others. Inspection revealed that there were 
nine individuals whose self-ratings were higher 
than their average rating by others. The Ss, 
therefore, were divided into three groups of 
nine each: (a) a group that overestimated 
its hostility and was moderately lacking in 
insight, with a mean self-minus-others score 
of 1.03, (6) a group that very slightly un- 
derestimated its hostility and was relatively 
insightful, with a mean of — 0.36, and (c) 
a group that considerably underestimated its 
hostility and was most lacking in insight, 
with a mean of — 1.72. For simplicity, the 
groups will hereafter be referred to as Re- 
pression Groups and attention will be turned 
to insight only when necessary. Differences 
between the groups were evaluated by analy- 
sis of variance of the measures of expressive 
reaction, cartoon rating, and cartoon Q sort. 
As there was some danger that the measures 
of insight and repression were confounded 
with absolute level of hostility, the groups 
were compared on mean hostility ratings by a 
simple between-within analysis of variance. 
An F of less than one was obtained indicating 
that any differences in over-all hostility were 
well within chance expectancy. 

In scoring expressive reactions, a weight of 
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one was given to facial expression and a 
weight of two to verbal expression. An analy- 
sis of variance was then performed which al- 
lowed for an evaluation of the following: 
Repression Groups (R), Subjects in same 
group (S), Favorable vs. unfavorable reac- 
tion (F), Hostile vs. control cartoons (C), 
and the interactions RX F, RXC, RXF 
x C, SXF, and § XC. In terms of the 
hypothesis under investigation, the critical 
source of variance is R X F X C, which indi- 
cates whether Repression Groups differ in fa- 
vorableness of response to hostile relative to 
control cartoons. This interaction did not ap- 
proach significance, yielding an F of less than 
one. A second source of variance concerning 
Repression Groups is the interaction R x C, 
which indicates whether the groups differ in 
amount of reaction, irrespective of kind, to 
hostile relative to control cartoons. Here, too, 
the F was less than one, suggesting a lack 
of relationship. The only source of variance 
relating to Repression Groups which ap- 
proached significance was R, which indicates 
total amount of expressive reaction. The F of 
2.68 was below the 3.40 required for signifi- 
cance at the .05 level, but did suggest a trend 
which is borne out by further findings. The 
order of the means was 9.69 for the group 
that moderately overestimated its hostility, 
4.94 for the group that slightly underesti- 
mated its hostility, and 5.11 for the group 
that considerably underestimated its hostility. 
Considering that the last two means are al- 
most identical, it appears that the group that 
overestimated its hostility tended to be more 
expressive than the other two. Additional 
findings from the analysis of variance are as 
follows: (a) There were highly reliable indi- 
vidual differences (.001 level) both in amount 
and favorableness of expression, (5) Favor- 
able expression occurred significantly more 
often than unfavorable expression (.001 
level), and (c) The hostile cartoons elicited 
more favorable reactions than the control 
cartoons (.01 level). 

The data from S’s ratings of the cartoons 
were analyzed in the same manner as ex- 
pressive reactions. Favorable and unfavorable 
ratings were summated independently, the 
“mediocre” rating receiving a weight of zero. 
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Repression Groups were found to differ sig- 
nificantly (.01 level) on total amount of re- 
ac ‘on in a manner confirming the nonsignifi- 
cant tendency noted for expressive behavior. 
The only other significant source of variance 
(.001 level) indicated that the hostile car- 
toons were rated more favorably than the 
control cartoons. 

The data from the Q sort were evaluated 
by assigning weights of one to seven to the 
successive columns into which the 32 car- 
toons had been sorted. Scores consisted of 
the weight received on each of the 16 hostile 
cartoons. Analysis of variance revealed that 
Repression Groups did not differ significantly 
either in their reaction to the pooled hostile 
cartoons or in their evaluation of the indi- 
vidual hostile cartoons. The only statistically 
significant finding (.001 level) was that some 
of the hostile cartoons were judged funnier 
than others. Despite the lack of a reliable re- 
lationship between repression and reaction to 
individual cartoons, the data were inspected 
for leads for further work. It was observed 
that the cartoon which was ranked first in 
funniness by the group that considerably un- 
derestimated its hostility was ranked sixth 
by the group that slightly understimated its 
hostility and ninth by the group that over- 
estimated its hostility. Thus, preference for 
this cartoon was directly related to denial 
of hostility. The cartoon was one by Virgil 
Partch depicting a wife plunging a revolving 
egg-beater into the back of her husband, and 
saying, “Stick ‘em up!” It was £’s impres- 
sion, confirmed by independent decisions 
from three other judges, that this cartoon 
was the most straightforwardly hostile in the 
sample. Unfortunately, only two similar car- 
toons of the “practical joke” variety depict- 
ing an unprovoked act of hostility by an 
adult could be found. In both cases a direct 
relationship was indicated between repression 
and preference for the cartoon. 

In order to determine whether it is neces- 
sary to take into account level of funniness 
of the cartoons, two separate analyses of 
variance of Q-sort ratings were done, one 
based on the four least funny, the other on 
the four most funny hostile and control car- 
toons as determined by the mean rating of 
the cartoon. On the least funny cartoons Re- 


pression Groups did not differ in preference 
for hostile relative to control cartoons but 
did differ in over-all rating of the pooled car- 
toons (.05 level). The order of the means was 
29.33 for the group that considerably under- 
estimated its hostility, 25.11 for the accurate 
group that very slightly underestimated its 
hostility, and 26.00 for the group that con- 
siderably overestimated its hostility. Thus, 
the group that evaluated itself most accu- 
rately judged the poor cartoons most appro- 
priately, and the reverse holds for the group 
that evaluated itself most inaccurately. On 
the most funny cartoon, Repression Groups 
did not differ significantly in their reaction 
to the pooled cartoons, but the order of the 
means for the same three groups was, respec- 
tively, 36.44, 39.56, and 37.00, which again 
suggested that accuracy of self-evaluation was 
directly related to appropriateness of reaction 
to the cartoons. Thus, taken together the 
findings on the least and most funny cartoons 
indicated a direct relationship between in- 
sight and sense of humor. 

In order to determine the relationship be- 
tween the three measures of reaction to hos- 
tile cartoons, Pearson product-moment cor- 
relations were computed. To do this, the 
measure of expressive reaction was expressed 
as a ratio of favorable reactions to hostile 
cartoons over favorable reactions to all car- 
toons. The same index was used for cartoon- 
ratings. For the Q sort, the score was simply 
the total weight on the hostile cartoons. The 
following correlations were obtained: expres- 
sive reaction and cartoon-rating, .57; expres- 
sive reaction and Q sort, .52; and cartoon- 
rating and Q sort, .79. All correlations are 
significant beyond the .01 level (¢ test, V = 
32). These results indicate a significant de- 
gree of reliability which is at least .52 for 
expressive reaction and .79 for cartoon-rat- 
ing and for Q sort. That the measure of ex- 
pressive reaction correlated lowest with the 
other two may indicate that it has the lowest 
reliability or that it measured something 
unique, or both. There is some evidence in 
support of the latter possibility in that ex- 
pressive reaction was the only measure on 
which significant individual differences were 
found. 
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Discussion 





Repression and humor. The results of this 
study fail to support the hypothesis that re- 
pression of hostility is related to a general 
preference for hostile cartoons. Negative re- 
sults are difficult to interpret as there is no 
way of knowing whether different operational 
measures or different populations would have 
produced different findings. In defense of the 
measure of reaction to cartoons used, ade- 
quate reliability for group differentiation was 
indicated. Moreover, validity was somewhat 
supported by the finding that reaction to car- 
toons was meaningfully related to insight. 
The measure of repression is somewhat more 
questionable, but it, too, was meaningfully re- 
lated to another variable in that it was in- 
versely associated with over-all expressive- 
ness. This relationship reveals a degree of 
reliability beyond chance, although there is 
no indication of how high it is. However, 
since the Q technique for measuring repres- 
sion required each S$ to judge and be judged 
by all other Ss relative to each other, an ac- 
ceptable degree of reliability would be ex- 
pected, at least approaching the minimum 
estimate of .79 obtained from the same Q 
sort for cartoons. Considering that all F 
values on the interaction of repression and 
preference for hostile cartoons were much less 
than one, there is no reason to suspect that 
higher reliability would have altered the find- 
ings. That another operational measure of 
repression might have produced positive find- 
ings remains to be demonstrated. Some ques- 
tion might be raised as to whether a group 
that overestimates its hostility should be con- 
sidered as low in repression. If only the ac- 
curate group and the group that underrated 
its hostility are compared, the results are 
equally negative. 

One explanation of the negative findings on 
repression and humor is that there is no gen- 
eral relationship between repressed hostility 
and preference for hostile cartoons. That 
there might be a relationship between re- 
pressed hostility and a preference for certain 
types of hostile cartoons was suggested by 
the results on a cartoon of the practical-joke 
type which depicted an unprovoked act of 
hostility. However, the evidence here was in- 
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clusive and verification is required. A second 
possibility is that a relationship would be 
found if suppression (conscious inhibition) 
or external frustration of an impulse were in- 
vestigated rather than repression (uncon- 
scious inhibition). This, too, remains to be 
determined, but seems likely in view of com- 
mon observations of the interest in sexual 
jokes of sexually frustrated groups such as 
adolescents (culturally frustrated) and men 
in the armed services (geographically frus- 
trated), who are not necessarily repressed. A 
third possibility for investigation is that a re- 
lationship between repression and humor be- 
comes manifest only when repression reaches 
pathological proportions (10). 

Insight and humor. Allport states, “. . . 
the most striking correlate of insight is the 
sense of humor” (2, p. 222). He cites one 
study in which a correlation of .88 was found 
between ratings on insight and ratings on 
sense of humor. In the present study it was 
found that there is a statistically significant 
relationship between rating the cartoons cor- 
rectly and judging one’s hostility accurately. 
Why are insight and humor related? Although 
there was no way of testing this in the pres- 
ent study, it is hypothesized that the two are 
related only when humor involves an ability 
to accept the self as its object. Presumably 
the person who is able to laugh at his limita- 
tions should be low in defensiveness and ac- 
cordingly should be insightful. It would be 
interesting to develop a measure of differen- 
tial appreciation of humor in which the ob- 
ject of some of the jokes or cartoons is self- 
related and of others is other-related. This 
measure might then be related to measures 
of defensiveness and insight. Perhaps a subtle 
clinical instrument for determining defensive- 
ness and insight would result. 

An incidental finding of some interest was 
that there are individual differences in degree 
and type of expressive reaction irrespective 
of the content of the cartoon responded to. 
This finding indicates that it is necessary to 
use control cartoons in evaluating differences 
in expressive reaction to any particular type 
of cartoon. It further indicates that individual 
differences in expressive behavior, apart from 
the nature of the cartoons responded to, is it- 
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self a promising personality variable for in- 
vestigation. 
Summary 

Preference for hostile humor was measured 
by three techniques, which included observ- 
ing S’s expressive reactions to hostile and 
control cartoons, having S rate the cartoons, 
and having him sort them on a Q sort of 
funniness. In order to obtain a measure of 
repression, the Ss, members of the same fra- 
ternity for at least a year, rated themselves 
and each other on a Q sort of hostility. The 
degree to which the self-rating was lower than 
the average rating by others was taken as an 
index of repression; the absolute discrepancy 
between the self-rating and rating by others 
was taken as an index of insight. Three groups 
of nine each were selected, a group that con- 
siderably overestimated its hostility, an accu- 
rate group that very slightly underestimated 
its hostility, and a group that considerably 
underestimated its hostility. Following are 
the major findings: 


1. No relationship was found between re- 
pression and general preference for hostile 
cartoons, but there was questionable evidence 
that repression was related to preference for 
a practical-joke type of cartoon depicting an 
unprovoked act of hostility. 

2. A significant positive relationship be- 
tween insight and sense of humor was found. 

3. A significant negative relationship be- 
tween repression and degree of reactivity was 
found. 

4. Highly reliable individual differences in 
degree and kind of expressive reaction irre- 
spective of the cartoons responded to were 
found. 


Implications for further work on the rela- 
tionship of humor to repression and insight 
were discussed. 
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Actual-Ideal Discrepancies 
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Recent studies by Rogers (4) and his asso- 
ciates have revealed that when subjects (Ss) 
are asked to do Q sorts of self-referent items 
under instructions to sort them so that they 
describe themselves as they are at present 
and then again so that they describe their 
concept of the ideal person, the correlation 
between these two sorts seems to vary with 
the presence or absence, or severity, of mal- 
adjustment. The size of the correlation be- 
tween actual- and ideal-self-sorts has been 
interpreted variously as a measure of self- 
esteem or self-acceptance, and the finding 
that persons classified as maladjusted have 
lower Q correijations than persons not so 
classified has been taken as evidence for the 
importance of self-acceptance in adjustment 
and the value of the self concept as a con- 
struct in personality theory. Other investi- 
gators have suggested defining personal mal- 
adjustment “as any discrepancy between the 
concept of self and the concept of ideal self” 
(1, p. 257). 

However, upon careful analysis of the pro- 
cedures and data involved in these experi- 
ments it does not appear that these infer- 
ences are entirely justified. In this paper we 
propose to examine the logical justification 
for these inferences and then to present data 
supporting an alternate hypothesis. 

The task set for the S in the typical actual- 
ideal Q-sort study is one of describing one 
aspect of his environment as he perceives it 
in its actual state and again as he perceives 
. its ideal state to be. The resulting correlation 
may be interpreted as an indication of the 


1The author wishes to thank Mr. Richard F. 
Gundy who ran the Ss and performed the statistical 
computations. 


perceived discrepancy between the actual and 
ideal states of the particular aspect of the en- 
vironment delineated for the S$ by the ex- 
perimenter’s instructions and the descriptive 
items he has to work with. In the studies re- 
ferred to above the aspect of the environment 
involved is labeled “the self” and a set of 
verbal items deemed by the experimenter as 
appropriate in describing the self function to 
delineate that aspect of his total environment 
for which the S is to indicate his perceived 
actual-ideal discrepancy. 

The finding that a certain portion of the 
variance of the discrepancy measure may be 
attributable to individual differences does not 
in and of itself permit any inferences con- 
cerning the significance of the particular as- 
pect of the environment used in obtaining 
this discrepancy measure. That is, until it 
has been demonstrated that actual-ideal cor- 
relations vary with adjustment if and only if 
they are obtained through the use of the self 
as the object of description, inferences con- 
cerning the importance of the nature of the 
self concept in personality functioning are 
not justified. It may be, for example, that the 
more maladjusted the individual the greater 
the discrepancy he perceives between the 
actual and ideal state of affairs over a wide 
range of areas of his environment. Thus, con- 
ceivably, one might discover a significant re- 
lationship between Q correlations for actual- 
ideal descriptions of geographical climate and 
personality adjustment. Unless one went on 
to sample other aspects of the S’s environ- 
ment as objects of description he might be 
led to construct a theory of personality func- 
tioning with climatological perception as a 
central construct. 
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The hypothesis which we propose to test 
in the present study is that individuals differ 
in the extent to which they perceive discrep- 
ancies between the actual and ideal states of 
various aspects of their environment and that 
these differences are fairly constant from one 
area to another. Thus an individual obtaining 
a high Q correlation when doing actual-ideal 
sorts of self-referent items would be expected 
to obtain a high Q correlation when doing 
actual-ideal sorts of items descriptive of other 
aspects of his environment, and similarly 
with an individual obtaining a low Q correla- 
tion between actual-ideal self-sorts. 


Procedure 


The Ss used in this experiment consisted of 
21 volunteers (2 females and 19 males) from 
an introductory psychology course, none of 
whom were aware of the nature of the ex- 
periment at the time of their volunteering 
for it. 

Each S was seen individually and was asked 
to do actual-ideal sorts of 100 self-referent 
items and of 100 home-town-referent items. 
The self-referent items were those developed 
by Butler and Haigh (2).? The home-town- 
referent items were collected in the follow- 
ing manner. Twenty students in the author’s 
personality class were given the following in- 
structions: “(a@) List five things that you like 
most about your home town. (6) List five 
things that you like least about your home 
town. (c) List five things which you feel 
would be true of the ideal home town.” From 
the resulting pool of 300 items, 100 were se- 
lected which appeared to have fairly wide ap- 
plicability and which provided an approxi- 
mately equal representation of positive and 
negative statements. 

The Ss were required to sort the state- 
ments for each sort into nine piles with the 
following distribution of cards per pile: 1, 4, 
11, 21, 26, 21, 11, 4, 1. Each S was first pre- 
sented with the self-referent statements and 
asked to sort them so as to describe himself 
as he sees himself at present from those least 
like himself to those most like himself. Fol- 
lowing this he was asked to sort the home- 


2 The author wishes to thank Dr. Rosalind F. Dy- 
mond for making these items available for this study. 


town-referent statements so as to describe his 
own home town. Next he was asked to sort 
the self-referent statements again so as to de- 
scribe his ideal person, or the person he would 
most like to be. And finally, he was asked to 
sort the home-town-referent statements so as 
to describe his concept of the ideal home 
town, the kind of town he would most like to 
live in. The same order of presentation was 
used with all Ss. 

The two Q correlations resulting for each S 
were then correlated over all Ss in order to 
determine the extent to which actual-ideal 
discrepancies in the two areas sampled were 
related. 


Results and Discussion 


The median Q correlation for actual-ideal 
self-descriptions was .47 with a range from 
.08 to 82. The median Q correlation for 
actual-ideal home-town descriptions was .24 
with a range from — .28 to .92. The Pearson 
correlation between the two sets of Q correla- 
tions was .70 which is significant at the .01 
level. This finding makes the null hypothesis 
untenable and gives some support to the ex- 
perimental hypothesis that the Q correlation 
obtained for an individual on actual-ideal 
self-descriptions represents a tendency to per- 
ceive actual-ideal discrepancies which tran- 
scends the particular aspect of the S’s envi- 
ronment used as the object of description. 
Furthermore, in the light of the present data, 
previous hypotheses concerning actual-ideal 
discrepancies which treated them as phe- 
nomena contingent upon such conditions as 
self-acceptance, self-esteem, self-satisfaction, 
etc., appear to be too narrowly conceived. 

While the self concept as an important con- 
struct may still be salvaged by invoking the 
processes of projection and rationalization on 
the part of the S to account for generalized 
perceived actual-ideal discrepancies, particu- 
larly in the case of maladjusted individuals, 
such reasoning must be recognized as ad hoc. 

In the present study only two aspects of 
the Ss’ environment were used as objects of 
description. Before any definite statement can 
be made about the generality of perceived 
actual-ideal discrepancies many more aspects 
would need to be sampled. It would appear 
however that research might be profitably di- 
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rected toward a study of the nature and ex- 
tent of perceived actual-ideal discrepancies. 
Certain rather interesting research prob- 
lems become opened up by such an approach 
and it may be of some value to sketch these 
briefly. One such problem would be: Given a 
certain number of objects of description, do 
individuals vary in the consistency of their 
perceived actual-ideal discrepancy over the 
range of these objects? What factors might 
best account for the variance in this con- 
sistency? What factors, if any, appear to be 
a function of the degree of perceived actual- 
ideal discrepancy in any given area of de- 
scription? Do these vary from area to area? 
The theoretical implications of such research 
are difficult to anticipate but it seems fairly 
clear that current self-oriented neophenome- 
nological personality theories (3, 5) would 
not be adequate to account for these data. 


Summary 


The logical justification for current inter- 
pretations of actual-ideal self-description dis- 
crepancies was called into question. An ex- 
periment was performed in which Ss did 
actual-ideal Q sorts using self-referent items 
and using home-town-referent items. The re- 


sulting sets of correlations were found to have 
a statistically significant correlation of .70 be- 
tween them and this was taken as evidence 
indicating a certain amount of generality to 
perceived actual-ideal discrepancies and fur- 
ther substantiating the criticism of current 
interpretations of these discrepancies when 
limited to the self as the object of descrip- 
tion. Implications of these findings for future 
research were sketched briefly. 


Received December 5, 1955. 
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The view has been current for some years 
that the period soon after electroconvulsive 
shock offers unique opportunities for psycho- 
therapy with mental patients. Though many 
clinicians have used the period rather exten- 
sively for this purpose, others have ques- 
tioned its value. For instance, Kalinowsky 
and Hoch write, “In our experience, psycho- 
therapy during convulsive therapy has often 
been impossible because of the patient’s con- 
fusion” (3, p. 118). 

How valuable is the postshock period for 
psychotherapeutic purposes? If it is valuable, 
how soon after shock should psychotherapy 
be instituted? Answers to these questions will 
depend, among other things, on an analysis 
of the mental state of the patient after dis- 
organizing shock, and a determination of the 
time pattern by which abilities are recovered. 
A number of research efforts have been di- 
rected toward this end, most of which have 
concentrated on memory functioning. Certain 
investigators (1, 5, 6) have reported a de- 
cided interference of shock with memory. Ex- 
perimental testing has usually been done sev- 
eral hours or more after shock, rather than 
immediately afterward. An example of work 
in the immediate postshock period, however, 
is that of Lunn and Trolle (4), who studied 
mental abilities 10, 30, and 60 minutes after 
shock. They note very poor retention for ma- 
terials presented shortly after shock. Thirty 
minutes afterward, however, they report that 
85 per cent of their patients were able to give 
“a satisfactory description of the content of 
a picture and at the same time express a com- 


1 The cooperation of Ott B. McAtee, M.D., Super- 
intendent, William E. Murray, M.D., Asst. Super- 
intendent, and the staff of Madison State Hospital 
in making their facilities available for this study is 
hereby gratefully acknowledged. 


prehension of the point of the picture” (4, p. 
376). 


The Problem 


Up to this time, relatively little research 
attention has been given to one of the crucial 
factors in the ability to profit from psycho- 
therapy—namely, learning. This is especially 
true of learning in the immediate postshock 
period, when psychotherapy is often insti- 
tuted. As a partial answer to this lack of in- 
formation, the present research was designed 
to study, under controlled conditions, the 
pattern for recovery of learning ability after 
shock. 

One possible, and no doubt partial, inter- 
pretation of the therapeutic effect of convul- 
sive shock is that it temporarily disrupts or 
disorganizes abnormal mental patterns and 
permits the substitution of different patterns 
of a more normal or healthy character. We 
are well aware of the fact that a startling 
event can shock a normal person “out of his 
complacency” and cause him to see familiar 
things in new ways. It may be that convulsive 
shock does somewhat the same thing for the 
abnormal person, and jolts him out of his 
“psychotic rut.” If this line of thought be 
followed, then it would appear that the opti- 
mum time to substitute new patterns of 
thought would be at the point where disor- 
ganization is great enough to permit their in- 
troduction and yet where learning ability is 
sufficiently recovered to permit their reten- 
tion. In other words, if the above interpreta- 
tion has any validity, the best time for a re- 
educative therapy would be as soon after 
shock as a reasonable degree of learning 
ability has returned. 

Subjects. Subjects in this investigation were 
five female and three male patients at Madi- 
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son State Hospital who were receiving elec- 
tric convulsive therapy. They were all liter- 
ate, in relatively good contact, and coopera- 
tive. Their ages ranged from 27 to 62, the 
average being 37.1. Six were schizophrenic, 
one manic-depressive, and one involutional. 

Experimental materials. Three types of 
learning materials were developed and used 
—pictorial, verbal, and numerical. To test 
visual object learning, a series of 16 line 
drawings was prepared. Each drawing con- 
tained a mannikin surrounded by certain ob- 
jects. Fifteen details were catalogued in each 
picture, such as type of hat, tie, etc. on the 
mannikin, and house, balloon, and other ob- 
jects surrounding the mannikin. Five differ- 
ent objects or conditions were systematically 
varied at each of the 15 locations on the dif- 
ferent pictures. For instance, the objects on 
the right side of the mannikin were a bicycle, 
a clock, a suitcase, a tree, and a truck; and 
the mouth of the mannikin was shown vari- 
ously as open, straight, corners up, corners 
down, and with one corner up and the other 
down. The subject was shown a picture and 
each detail was pointed out and verbally 
called to his attention with the request that 
he remember as many as possible. Each pic- 
ture also contained a different mental hy- 
giene aphorism having eight words and five 
ideas, which was attributed to the mannikin. 
An example was, “Interests outside make a 
person far happier inside.” Patient motiva- 
tion for the task was good and several seemed 
to enjoy it. Indeed, success with this pro- 
cedure would seem to dictate its wider use as 
a measure of learning ability. 

To test verbal learning, 16 lists of paired 
adjectives were developed. Each list contained 
11 pairs. They were obtained from a list of 
300 pairs developed by Melton and Safier 
(2), and were relatively equivalent in terms 
of the similarity ratings given them by 96 
university students. A list was slowly read 
over once to a subject with the instruction 
that he should try to remember as many of 
the pairs as possible. Thereupon he was 
handed a typewritten list containing the first 
adjective in each pair ranged on the left 
margin, and on its right the second member 
imbedded among three other synonymous 
adjectives. For instance, the pair Pleasing- 


Charming, appeared on the typed list given 
to him as follows: Pleasing: Pleasant, Gra- 
cious, Charming, Winning. The subject was 
asked, “Which word did I read with Pleas- 
ing?” The same procedure was employed with 
each of the other pairs. 

To investigate numerical learning, a series 
of 16 lists of numbers was prepared. Each 
list contained two samples each of two-digit 
through eight-digit numbers, arranged as in 
the Wechsler-Bellevue [Intelligence Scale. 
Starting with the shorter numbers, subjects 
were asked to repeat each one immediately 
after it was read to him. His score was the 
highest number he could accurately repeat. 
No attempt was made to go further when 
both samples of a given length were failed. 

Procedure. Subjects were enlisted by ex- 
plaining to them that they had been chosen 
for a series of memory tests. One practice 
trial was given on each of the three types of 
materials to familiarize the subject with the 
procedure. The order of presentation was 
picture, paired adjectives, and numbers. 


A control series of tests was run with each subject 
one to three days before shock was begun. Runs of 
the three types of tests were given six different times. 
Some hours to a day intervened between the first 
three and the last three runs. A short break of a 
minute or so was introduced between runs. 

Experimental testing was begun as soon after 
shock as the patient was able to get up and go to 
a nearby examining room. Two runs of tests were 
given in succession. A third run was given after the 
patient had had lunch. This procedure was followed 
after three shocks—in practically all cases the first 
three. This made a total of 9 experimental runs as 
compared to 6 control runs. Runs were rotated so 
that four subjects got the test series in the order 1 
to 15, and the remaining four in the reverse order. 

The time for beginning of experimental testing 
after shock varied from 10 to 30 minutes, with an 
average for all subjects of 184 minutes. Several pa- 
tients became nauseated, showed confusional retarda- 
tion, or displayed paranoid resistance. All these 
caused testing delay. 


Results 


Table 1 shows clearly the effect of shock 
on learning performance. A profound inter- 
ference with learning ability is shown in the 
first test run after shock, which was an aver- 
age of 184 minutes later. The critical ratios 
of differences between experimental and total 
control means are highly significant except in 
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Table 1 
Scores on Control and Experimental Series 








Control runs 




















Test and measure 1 2 3 Total 1 2 3 
15 Mannikin Items 
Mean 10.8 11.3 11.4 11.2 2.5 8.3 12.2 
Percentage of success 72 75 76 75 17 55 81 
SD 2.21 2.96 2.87 2.21 
CR with control total 12.39 4.23 1.82 
5-Point Mannikin Aphorisms 
Mean 3.6 2.7 3.2 3.2 A 1.8 3.4 
Percentage of success 72 54 64 64 8 36 
SD 1.87 1.26 1.87 1.73 
CR with control total 7.26 2.89 
11 Paired Adjectives 
Mean 7.6 7.3 8.6 7.8 4.5 5.5 7.7 
Percentage of success 69 66 78 71 41 50 70 
SD 2.47 1.66 2.22 2.03 
CR with control total 6.76 3.99 -— 
8 Numbers 
Mean 6.1 6.1 6.0 6.0 5.9 6.0 6.5 
Percentage of success 76 76 75 75 74 75 81 
SD 


CR with control total 





the case of numerical learning. Interference is 
noted in less but still significant degree on 
the second experimental run, except in the 
case of numerical learning. The second test- 
ing run began an average of 38% minutes 
later. Learning performance has returned to 
preshock levels by the third testing run, 
which was an average of 2% hours later. 


Discussion of Results 


It is clear from these results that electric 
shock profoundly interferes with learning 
abilities. Greater interference is shown for 
abstract than for more concrete materials. 
The most extensive interference occurred in 
the case of the aphorisms, which were the 
most abstract materials used in this experi- 
ment. During the first test run after shock, 
only two persons of the eight could remem- 
ber any part of the aphorisms. One of these 
two could not be tested until 30 minutes 
after shock. The least interference occurred 
in the case of numerical learning and report. 
This task represented the lowest degree of 
abstraction. Hetherington (1) also found 


93 1.38 1.23 1.04 


little influence of shock on immediate num- 
ber recall. 

Learning was measured in this experiment 
by means of immediate report. No attempt 
was made to determine whether materials 
were retained over a longer period of time. 
In a re-educative therapy program, long-term 
retention would be necessary, of course. 

So far as learning ability is concerned, it 
would appear highly unprofitable to attempt 
.a re-educative type of therapy immediately 
after shock. This is deduced from individual 
as well as group scores. While individual 
variations in recovery rate were smoothed by 
combining them to produce a group picture, 
all individual patterns of recovery closely 
paralleled the group pattern. 

By two to three hours after shock, results 
show that learning ability has returned to 
preshock levels. It is quite probable that con- 
trol level ability may return even earlier. 
Timing for the third test run was set at the 
completion of the lunch hour for patients 
and so this was not determined. The shortest 
period for beginning the testing of any pa- 
tient on the third run of tests was 1 hour and 
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55 minutes after shock. In this one case learn- 
ing ability was back to preshock level. 

Insofar as present results are concerned, 
re-educative therapy is contraindicated for 
most patients during the first hour after 
shock. Recovery rates varied for different pa- 
tients, but preshock learning performance is 
recovered somewhere between 1 hour and 24 
hours after shock. According to the theory 
enunciated earlier in this article, somewhere 
within this interval would seem to be the 
optimum time for the institution of re-edu- 
cative efforts. 


Summary 


Using pictorial, verbal, and numerical ma- 
terials, the learning ability of eight mental 
patients was tested both before and at stated 
intervals immediately after administration of 
electroconvulsive shock. Results showed a 
profound interference of shock with learning 
ability. Interference was greater for abstract 
than for more concrete learning materials. 
Learning abilities were recovered in a gradual 
fashion, with preshock level being restored 
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not later than two to three hours after shock. 
Implications of the results for a re-educative 
type of therapy are noted. 


Received January 3, 1956. 


References 


1. Hetherington, R. Psychological changes due to 
E.C.T. as measured by a battery of tests 
given repeatedly. J. ment. Sci., 1952, 98, 334- 
341. 

2. Hilgard, E. R. Methods and procedures in the 
study of learning. In S. S. Stevens (Ed.), 
Handbook of experimental psychology. New 
York: Wiley, 1951. Pp. 517-567. 

3. Kalinowsky, L. B., & Hoch, P. H. Shock treat- 
ments, psychosurgery. (2nd Ed.) New York: 
Grune & Stratton, 1952. 

4. Lewis, N. D. C. Psychiatry. In R. P. Mackay, 
N. D. C. Lewis, & P. Bailey (Eds.), The 1949 
year book of neurology, psychiatry, and neu- 
rosurgery. Chicago: Year Book Publishers, 
1950. Pp. 209-423. 

5. Stone, C. P. Losses and gains in cognitive func- 
tions as related to electro-convulsive shocks. 
J. abnorm. soc. Psychol., 1947, 42, 206-214. 

6. Zubin, J. Objective studies of disordered persons. 
In T. G. Andrews (Ed.), Methods of psy- 
chology. New York: Wiley, 1948. 


ee 





Journal o, Con Psychology 
Vol. 20, No. 5, 1956 


Note on “Changes in Psychoneurotic Patients 
With and Without Psychotherapy” 


Desmond S. Cartwright 


University of Chicago 


A recent paper in this Journal by Barron 
and Leary (2) reports a comparison of 
changes in a treatment group of psychoneu- 
rotic patients with changes in a nontreatment 
group. Groups were matched for type and 
severity of disorder, prognosis, socioeconomic 
variables, and interval of time between pre- 
tests and posttests. 

Of the patients in treatment, 42 received 
individual and 85 received group psycho- 
therapy. The measures employed were MMPI 
scales, including a prognostic scale (Es) de- 
veloped by Barron (1). The crucial data 
showed mean differences in scale scores be- 
tween pretests and posttests. Comparison of 
these data by critical ratio analysis led to the 
conclusion that: “The therapy patients did 
not improve significantly more than did the 
waiting-list controls” (2, p. 245). 

This conclusion answered the question: 

. is there significantly more change with 
regular and formally defined psychotherapy 
than without it?” (2, p. 245). Statistically, 
the answer given states that the treatment 
and nontreatment samples must be considered 
to be drawn from the same parent population 
in regard to mean changes. The presence or 
absence of psychotherapy makes no difference. 

However, from the data presented (2, p. 
243), it appears that the samples differ as 
to the dispersion of changes (SDs). Table 1 
presents the relevant data for the individual 
psychotherapy group and the nontreatment 
group, together with F ratios computed from 
the variances. 

Table 1 suggests with high probability that 
the two samples are not drawn from the same 
parent population in regard to the variance of 
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changes on scales F, Hs, D, Pt, Sc, and Es. 
This means that the presence or absence of 
individual psychotherapy makes a consider- 
able difference. 

The implication of the results in Table 1 is 
that in fact there was “significantly more 
change with regular and formally defined 
psychotherapy than without it.” 


Table 1 


Variances of Discrepancy Scores on MMPI Scales 
for Individual Psychotherapy and 
Nontreatment Groups 














Individual Nontreatment 
psychotherapy group 

(N =42) (N =23) 
MMPI 
Scale Ls V F p 
1. Lie 19.89 23.43 1.18 — 
2. F 215.21 22.94 9.38 01 
3. K 55.95 31.70 1.76 -— 
4. Hs 127.46 64.16 1.99 05 
5. D 244.30 93.32 2.62 01 
6. Hy 113.21 87.80 1.29 — 
7. Pd 155.00 89.68 1.73 — 
8. Pa 111.94 68.06 1.64 — 
9. Pt 208.51 73.27 2.85 01 
10. Se 272.91 74.13 3.68 01 
11. Ma 126.79 75.34 1.68 — 
12. Es 43.56 14.82 2.94 01 





*V Veo pou from SD data reported by Barron and 
Leary (2, p. 243). 


The greater amount of change, however, 
was not all in one direction. Thus the au- 
thors’ conclusion that “The therapy patients 
did not improve significantly more than did 
the waiting-list controls” is correct but in- 
sufficient. For many scales the variance re- 
sults suggest that mean differences between 
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the groups are absent because differences of 
two kinds, opposed in sign, are present. It 
seems that some therapy patients deterior- 
ated to a greater extent than did the waiting- 
list controls, while some therapy patients did 
improve significantly more than the controls. 


Received November 9, 1955. 
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A Short Form of the Wechsler-Bellevue and 
the Army Classification Battery as 
Measures of Intelligence’ 


Donald B. Murphy 


Army Medical Service School, Ft. Sam Houston, Texas 


and Robert D. Langston 
Br. U. S. Disciplinary Barracks, Ft. Crowder, Mo. 


Although the Army Classification Battery 
and short forms of the Wechsler-Bellevue 
have been used as measures of intelligence, 
a review of the literature indicated the ap- 
plication of previously published short forms 
to an Army disciplinary barracks popula- 
tion was somewhat questionable. The pres- 
ent study was designed to investigate the va- 
lidity of a short form of the W-B and of 
Aptitude Area I (AAI) of the Army Classifi- 
cation Battery as measures of intelligence for 
this population. 

For the present study, the records of 515 
consecutive releasees from the Br. U. S. Dis- 
ciplinary Barracks, Ft. Crowder, Missouri, 
were reviewed. The ages of this group of male 
prisoners ranged from 16 to 41 with the mode 
falling at the 20-24 age grouping. Eighty-five 
per cent worked at unskilled labor or had no 
occupation prior to induction. Negroes com- 
prised 30% of the group. 

The W-B full scale IQ, short form IQ 
based on the Information, Comprehension, 
and Block Design subtests, and the scores on 
the AAI, which has a normative population 
mean of 100 and SD of 20, were obtained 
for each case. 


1An extended report of this study may be ob- 
tained without charge from Capt. Donald B. Murphy, 
AMSS, Ft. Sam Houston, Tex., or for a fee from 
the American Documentation Institute. To obtain it 
from the latter source, order Document No. 4970 
from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washing- 
ton 25, D. C., remitting in advance $1.25 for micro- 
film or $1.25 for photocopies. Make checks payable 
to Chief, Photoduplication Service, Library of Con- 
gress. 


The mean of the full scale W-B was 93.2 
with an SD of 16.1; the mean of the short 
form was 92.8 with an SD of 16.9; and the 
mean of the AAI was 81.7 with an SD of 
22.2. The short form of the W-B and the 
AAI correlated .91 and .83, respectively, with 
the full scale W-B. 

The advantage of the short form, since it 
is a reliable predictor of full scale scores, is 
that testing time is reduced, permitting rapid 
processing and more time for evaluation of 
nonintellectual factors. 

The Ss used in this study had a signifi- 
cantly lower IQ (p < .01) than Wechsler’s 
normative group and differed from Wechsler’s 
population with respect to occupational status, 
education, and race. It is interesting to note 
that the short form used in this study has 
been reported to have the highest correlation 
with the full scale W-B for Wechsler’s norma- 
tive group, thus indicating the stability of 
these subtests despite differences in socioeco- 
nomic, educational, and racial backgrounds. 

The mean for the AAI was significantly 
lower (p < .01) than the AAI mean (90.7) 
estimated on the basis of the Ss’ W-B scores. 
Marked increase in the variability of AAI 
scores below 75 suggested that many of the 
low scores resulted from poor motivation at 
the time the test was administered rather 
than from low intelligence. When AAI scores 
below 75 were discarded, the mean of the 
AAI scores (93.4) closely approximated the 
estimated AAI mean and correlated .88 with 
the full scale W-B. 


Brief Report. 
Received April 3, 1956. 
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“Reliability of the Blacky Test”: 
A Reply to Charen 


Gerald S. Blum 


University of Michigan 


After reading Charen’s brief report on the 
retesting of 25 tuberculous patients, printed 
in the February issue of this Journal (5), I 
was left with two major misgivings: (a) the 
title is a misnomer inasmuch as the technique 
was used improperly; and (0) the author’s 
reasoning is faulty. I shall elaborate these 
points below: 

1. The Blacky Pictures are designed to 
yield conflict scores for a series of psycho- 
sexual dimensions. As stated in the Manual 
(3), four separate sources contribute to each 
dimensional score: spontaneous story, inquiry 
items, related comments, and cartoon prefer- 
ences. The underlying conception of reliabil- 
ity, upon which the instrument was con- 
structed, assumes that these sources tap the 
conflict area at varying levels and in com- 
bination provide the best possible estimate 
of conflict intensity. The revised scoring sys- 
tem (4) referred to by Charen makes clear 
this necessity for including all four sources in 
deriving a reliable dimensional score. Instead, 
Charen arbitrarily singled out for investiga- 
tion one source, multiple-choice items from 
the inquiry, which is stated to be closest to 
consciousness and, therefore, most easily dis- 
torted by the subject. Also the technique was 
administered incorrectly, since spontaneous 
stories were not obtained prior to the re- 
sponses to multiple-choice items. 

2. From the facts that the Rorschach and 
paper-and-pencil tests indicated no “basic 
changes” in personality whereas the Blacky 
items did show some (basic?) changes, 
Charen infers only two possible alternatives: 
“Either the Blacky questions tapped person- 
ality levels not reached by the other tests or 
did not have good reliability.” He rejects the 
first alternative by a statement of faith in the 
Rorschach. I prefer to call upon evidence con- 


tained in a study by Berlow (2), who com- 
pared Blacky scores (in a sample of patients 
whose responses had previously been mean- 
ingfully related to theories of paranoid schizo- 
phrenia [1]) to psychosexual indicators on 
the Rorschach. In addition to using judges to 
assess psychosexuality from the Rorschach, 
Berlow conducted a systematic and exhaus- 
tive analysis of signs, patterns, and content. 
No significant relationships to the Blacky 
were established. Had Charen employed the 
Blacky correctly, there would still be no rea- 
son to expect congruence with Rorschach 
findings. Berlow’s results are not introduced 
to set off a debate on the relative merits of 
the two instruments, but simply to point out 
that there is no place for the Rorschach in 
Blacky’s orbit of construct validation. 

Finally it is important to keep in mind that 
even when the technique is administered and 
utilized correctly, the retest method, reflect- 
ing as it does an unknown mixture of changes 
in set, familiarity with items, and perhaps 
personality changes as well, cannot properly 
be employed to study reliability of responses 
to the Blacky Pictures. 


Received March 23, 1956. 
Early Publication by the Editor. 
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A Reply to Blum 


Sol Charen 
Montgomery County Mental Health Clinic, Rockville, Maryland 


Blum pinpoints emphasis on my title to 
suggest that the technique I used is incorrect. 
A title is only intended to furnish brief infor- 
mation about an article. Mine could have 
been clearer. It is also necessary to read 
the article to find out what it is about and 
not rely solely on the title. Clearly stated 
throughout my brief report, “The Reliability 
of the Blacky Test,” is the fact that it deals 
with the low reliability of the multiple-choice 
questions of the Blacky test, as found in my 
doctoral research (2). This is all my article 
deals with, not the total Blacky test, and 
Blum is unwittingly in accord with my text 
when he states in his reply “. . . multiple- 
choice items from the inquiry, .. . [are] 
closest to consciousness and, therefore, most 
easily distorted by the subject.” As he points 
out, though, the multiple-choice questions do 
furnish one of the four sources which con- 
tribute to each dimensional score. The ques- 
tion of reliability of the total Blacky test is 
therefore involved when a major score source 
may be questionable. 

Since spontaneous stories were not obtained 
from my chronically ill patient subjects I did 
not follow his technique. How this omission 
affected their responses to the multiple ques- 
tions for each cartoon I do not know. But 
Blum is on shaky ground when he implies 
that this omission somehow affected the reli- 
ability since he has no reliability studies of 
his own for either the total Blacky test or any 
of its four scoring sources. Without such data, 
his assumption is purely hypothetical that the 
omission of the spontaneous stories somehow 
affected the reliability in the test-retest 
situation. 

Blum has lifted a sentence out of context. 
Nowhere did I use the Rorschach test to 
validate either the Blacky test or its multi- 
ple-choice items. The-Rorschach and fifteen 


paper-and-pencil tests were used to evaluate 
whether my patients in a four-month period 
showed test evidence of basic personality 
changes as a result of chronic illness and ex- 
tensive hospitalization. No test evidence of 
such changes was found. Only the multiple- 
choice items of the Blacky test indicated that 
some changes took place in my experimental 
subjects. These, in general, suggested they 
were nonregressed during illness testing and 
were regressed when retested shortly before 
discharge from the hospital. I believe I am 
consequently justified in assuming that the 
equivocal results found in the Blacky multi- 
ple-choice items may have been due to the 
factor of the low reliability which was found 
to exist. 

One final comment about the last para- 
graph of Blum’s reply is in order. The ques- 
tion of set should not be involved in the 
Blacky test as it might be in other person- 
ality tests since his scoring allows for evasive- 
ness or attempts to use defenses. Familiarity 
with items is not likely to be a problem when 
a four-month or longer period elapses be- 
tween the test and retest. Finally, since the 
Blacky test purports to measure, “the deeper 
recesses of personality” (1), namely the ge- 
netic psychosexual dimensions which should 
not change in the individual unless psycho- 
analysis or a marked life experience occurs, 
personality change is also not a factor which 
should prevent use of the retest method. 


Received June 25, 1956. 
Early publication by the editor. 
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Doll, Edgar A. Vineland Social Maturity Scale. Min- 
neapolis: Educational Test Bureau, 1947. 


Reviewed by 


Morris Krugman 
Board of Education, New York, N. Y. 


An instrument that has been in use for thirty 
years is hardly news. The first article about the 
Vineland Social Maturity Scale by Edgar Doll ap- 
peared in 1935, after almost ten years of use and 
development at the Vineland Training School; the 
last revision was made in 1936, and the manual ac- 
companying the current form of the scale was pub- 
lished in 1947. Even a casual inspection of psycho- 
logical literature reveals more than a hundred titles 
discussing or mentioning use of the scale, and many 
more are undoubtedly available. 

Something new, however, has been added, with 
the publication of The Measurement of Social Com- 
petence by Edgar A. Doll in 1953. For the first time 
in thirty years, the rationale, the details of the 
standardization process, scoring samples, methods of 
interpretation, and other pertinent data usually sup- 
plied when psychological instruments are first pre- 
sented for general use, are made available for the 
Vineland Social Maturity Scale. Although the brief 
1947 manual mentions the care with which the 
standardization process was carried out, any real 
idea of the meticulousness of this process can be 
gained only from the voluminous 1953 manual. Each 
of the 117 items of the scale is described in minute 
detail with possible variations, normative curves are 
presented for each item, and means, medians, SDs 
and CRs supplied. Line drawings illustrating typical 
behavior are added. One wonders whether the treat- 
ment of data was overelaborated, since the norms 
are based on ten males and ten females at each age 
level from 0-1 to 30, a total of 620 subjects. 

The scale is too well known to require detailed de- 
scription here. Briefly, it consists of 117 items, scaled 
in order of difficulty and arranged by years from 0 
to 30. The items are distributed among six categories: 
self-help (general, eating, and dressing), self-direc- 
tion, occupation, communication, locomotion, and 
socialization. “Each item is conceived as representing 
a general growth in social responsibility which is ex- 
pressed in some detailed performance as an overt 
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expression of that responsibility. Consequently, the 
value of the detailed items is to be determined prin- 
cipally by the extent to which they reflect this per- 
sonal independence in personal activities, in respect 
to which the detailed performances are otherwise 
relatively unimportant.” The items “reflect progres- 
sive freedom from need of assistance, direction, or 
supervision on the part of others.” 

Although direct rating by the examiner is not 
ruled out if opportunity for observing or knowing 
the subject is available, the usual method of use is 
by interviewing an informant who is thoroughly fa- 
miliar with the behavior of the subject. The fact that 
the subject being examined and rated need not be 
present or observed is both an advantage and a dis- 
advantage. The obvious advantage is the case of ap- 
plying the scale by questioning a generally mature, 
competent adult about an immature, incompetent, or 
handicapped individual. The disadvantage lies in the 
possibility of gross distortion and incorrect rating. 
The reviewer has always experienced a feeling of 
greater confidence about ratings when the subject 
was seen even briefly after the usual rating by means 
of an informer, and when a few items of the scale 
have been spot checked. 

Usually, the publication of basic data about a clini- 
cal instrument serves to clarify issues and strengthen 
the use of the instrument. This is not the case with 
the Vineland Social Maturity Scale. That the scale is 
a very useful clinical adjunct with many different 
types of subjects is attested to by the almost uni- 
formly favorable literature over the past twenty 
years. This reviewer has used it for many years and 
has tremendous respect for it. But this respect is not 
increased by careful reading of the new manual. The 
small number of cases at each level has been men- 
tioned and indeed was known from the earlier 
manual, but the elaborate statistical and graphic 
treatment of each item is not convincing, since, in 
spite of the fact that continued appraisal of each 
item was made statistically, experimentally, and 
clinically, not one of the 117 items has been shifted 
up or down, nor have items been replaced or modi- 
fied. Equally questionable is the upper end of the 
scale, intended for normal and superior adolescents 
and adults. Although based on normative data, this 
part of the scale could well be an extrapolation of 
the lower end of the scale, and it hardly makes 
sense. An ingenious method of measuring defects in 
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social maturation of children and adults, and indi- 
cating the level of social development of children 
who are not necessarily retarded in development, 
the Vineland Scale is artificial when applied to nor- 
mal or superior adolescents or adults. Most users of 
the scale simply ignore the upper levels. 

The Vineland Social Maturity Scale has its great- 
est usefulness in distinguishing between real and 
pseudo-feeblemindedness, in providing a measure of 
social development in studies of child growth and 
development, and in appraising levels of social func- 
tion among handicapped individuals of many kinds 
and in many types of pathology, particularly in 
cases of arrested development or deterioration. It 
has made a permanent place for itself in clinic and 
laboratory, and newer approaches to the appraisal 
of social maturity have not succeeded in replacing 
it. The fact that it has not been found necessary to 
modify or “improve” the scale over these many 
years can be accepted as evidence of the carefulness 
with which it was originally constructed, and the 
keen clinical insight that obviously determined the 
original construction and standardization. 


Doll, Edgar A. Vineland Social Maturity Scale. Min- 
neapolis: Educational Test Bureau, 1947. 


Reviewed by 


Edwin S. Shneidman 
VA Neuropsychiatric Hospital, Los Angeles 


The Vineland Social Maturity Scale is not a psy- 
chological test in the usual sense, nor does this re- 
view pretend to be the usual type of test review. 
Most tests are samples of modes of reacting, per- 
ceiving, or adjusting, or are extrapolations from seg- 
ments of behavior. The Vineland Social Maturity 
Scale, on the other hand, attempts to measure be- 
havior directly. Although Doll says that “The scale 
is not a rating scale and scores are not to be based 
on mere opinions” (italics his), nevertheless the scale 
would seem to be in the same tradition and of the 
same genre as the Gesell Development Schedules, the 
Malamud and Sands Psychological Rating Scale, the 
Ferguson-McReynolds Hospital Rating Scale, etc. 
The Vineland Social Maturity Scale is a well known 
(to more than one generation of clinicians) measure 
of the social and adaptational skills, habits, traits, 
coping behaviors, and interpersonal reactions of sub- 
jects between the ages of 0-1 and 25 plus. Doll in- 
dicates that the purpose of his scale is “To provide 
an over-all measure of personal adequacy in terms 
of social maturity.” He further indicates that the 
scale is a “descriptive analysis of maturation in so- 
cial competence, as a standardized schedule of be- 
havior development in terms of its habitual ex- 
pression.” He feels that it has use in establishing a 
standard schedule of normal development, in measur- 
ing deviations which may be significant in such prob- 
lems as delinquency, amentia, adoption of children, 
etc., in measuring improvement following training 
or therapy, etc. 


There are 117 items in the scale, arranged in order 
of average age norms. The items have been chosen 
to report some aspect of the individual's ability to 
look after his own practical needs. Social quotients 
(SQ) “generally comparable to Binet MA and IQ 
can be obtained from the sum of the items scores.” 
The items themselves range from such as “ ‘Crows’; 
laughs” (0-1) to “Advances general welfare” (XXV). 
The items at the upper end of the scale (ages 20 up) 
such as “Uses money providently,” “Contributes to 
social welfare,” “Systematizes his own work,” “In- 
spires confidence,” “Promotes civic progress,” “Ad- 
vances the general welfare,” etc. seem extremely hard 
to evaluate. Nor does Doll’s injunction that “The 
informant does not make the scoring judgment. This 
is done by the examiner after obtaining from the in- 
formant as much detail as practicable regarding the 
behvaioristic facts which reveal the manner and the 
extent of the subject’s actual performance on each 
item” (italics his) really circumvent the essential 
problem of observing and rating items that are es- 
sentially difficult to rate. 

The main purpose of this summary is neither to 
describe the Vineland Social Maturity Scale nor to 
present a survey of the published research on it; it 
is rather to discuss some aspects of its implied ra- 
tionale, and some cautions in its application. Cer- 
tainly the careful use of the scale to distinguish 
those subjects who do deviate from the norms of 
“social maturity,” so that further study of those in- 
dividuals can be made, would be a useful and re- 
warding application of this instrument. However— 
and this is the burden of this review—the Vineland 
Social Maturity Scale lends itself easily to other in- 
terpretations. 

Ordinarily, when one uses an intelligence test in 
order to obtain an IQ, the main purpose of such an 
effort is to ascertain what the IQ is, although the 
implicit assumption may be that the higher the IQ, 
the better. This implicit assumption operates in a 
much more subtle and in a more questionable way 
in the case of the Vineland Social Maturity Scale. 
Specifically, the assumption is that social behavior 
(including social motivation) is desirable as it ap- 
proaches social conformity. Further, the implicit as- 
sumption is that optimal behavior lies in adjustment 
to a standardized society and conformity to the 
“normal.” Without becoming embroiled in the in- 
tricacies of the late Robert Lindner’s concepts of 
rebellion and salutary nonconformity, it still re- 
mains that the Vineland Social Maturity Scale de- 
emphasizes the individual’s fealty to his own image, 
his strength to be himself, his positive (that is crea- 
tive or effective) idiosyncracies, his areas of adapta- 
tion which may bring him pleasure, or the applica- 
tion of his resources (within his total personality 
organization) toward creativity or self-actualization. 
The recent work of Jean MacFarlane at Berkeley 
and Lois Murphy at Sarah Lawrence would seem to 
indicate that the conforming person is not neces- 
sarily the best adjusted person. One is reminded of 
a section of D. B. Klein’s book Mental Hygiene en- 
titled, “In Defense of the Shy Child” (who does not 
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necessarily become schizophrenic) and one wonders 
if a section should be added in defense of the non- 
conforming child (who need not necessarily become 
maladjusted, psychopathic, or feebleminded). 

The point of all the above statements is that the 
uncritical use of the Vineland Social Maturity Scale 
may imply an unwarranted assumption of the merits 
of conformity and standardization and to permit a 
confusion between aberrant or pathological noncon- 
formity (schizophrenia, for example, at one ex- 
treme). To make these assumptions would be to 
gainsay the infinite variety and the rich diversifica- 
tion of talents and gifts of human subjects. 

As a clinical psychologist—and as a father of four 
—I am not sure that I know what the “best” age 
for toilet training, for example, really is. I would 
also ask whether the child has a problem if he isn’t 
toilet trained by “that” age or rather cannot the 
parent create problems by imparting parental and 
societal anxieties and concerns—his problems—to the 
child. 

The major reference for the Vineland Social Ma- 
turity Scale is Doll’s The Measurement of Social 
Competence (Minneapolis: Educational Test Bureau, 
1953). Two shorter references are the Manual of In- 
structions (published by the Educational Test Bu- 
reau, dated 1947) which cites 59 references, and Dr. 
Doll’s chapter (in the second part of Chapter 20) 
on his scale in Weider’s Contributions Toward Medi- 
cal Psychology (New York: Ronald Press, 1953). 


Rotter, Julian B., & Rafferty, Janet E. The Rotter 
Incomplete Sentences Blank (College Form). New 
York: Psychological Corporation, 1950. 


Reviewed by 


Louis Long 
The City College of New York 


The Rotter Incomplete Sentences Blank (ISB) “is 
a semistructured projective technique in which the 
subject is asked to finish a sentence for which the 
first word or words are supplied” (Manual, p. 3). 
Historically, the incomplete sentence method is 
closely related to word association tests, and the 
ISB has a long history of evolution. Sentence forms 
used by Rotter and Willerman, Shor, Hunt, Holz- 
berg, Rohde and Hildreth, and Payne have had an 
effect upon the ISB. As in all projective situations 
it is assumed that the subject in completing the sen- 
tences will reflect something of his own personality 
—his fears, desires, wishes, anxieties. 

The ISB consists of 40 “stems” that are to be 
completed by the subject. The completions are then 
scored by comparing them with responses that have 
been assigned a scale value varying from 0 to 6. 
The total score is an index of maladjustment. 

The ISB may be used in two very different ways: 
(a) as a gross screening instrument; and (5), as a 
clinical tool. It seems desirable to evaluate these two 
uses separately. 
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Evaluation of the ISB as a screening instrument. 
Biserial coefficients of 0.50 and 0.62 between scores 
and classifications of adjusted and maladjusted stu- 
dents are presented in the manual to show how effec- 
tive the test is when used as a screening instrument. 
The small number of cases upon which these data 
are based should, however, warn clinicians to pro- 
ceed with caution in using the ISB for this purpose. 
Each person using the test will do well to study the 
effectiveness of it in his own local situation. This in 
essence means that the ISB must be validated locally 
which will necessarily require setting up criterion 
groups of adjusted and maladjusted individuals. In 
their manual, Rotter and Rafferty describe in detail 
the method they used to establish their criterion 
groups. It is easy to sense their dissatisfaction with 
the use of ratings of adjustment or maladjustment 
as supplied by instructors teaching classes in effec- 
tive study and in mental hygiene, or by advanced 
student clinicians. In the reviewer’s opinion, the diffi- 
culty of establishing satisfactory criterion groups 
will inevitably make it impractical for many colleges 
to use the ISB. If, however, the aggressive research 
program which seems to be underway at present is 
continued it may eventually be possible for colleges 
to use the ISB as an aid in screening students for 
possible referral to a counseling agency. 

Evaluation of ISB as a clinical tool. To facilitate 
the use of the ISB as a clinical instrument the au- 
thors of the Manual have explained their scoring 
procedure in detail, have given many examples of 
the various kinds of responses, have presented prac- 
tice cases along with the correct scoring of the re- 
sponses, and have given a clinical interpretation of 
six records. Each of the six cases begins with a brief 
background sketch of the individual, followed by a 
record of his responses on the ISB, and an analysis 
of the case in terms of familial attitudes, social and 
sexual attitudes, general attitudes, and character 
traits. The reports conclude with a summary, and a 
discussion of the contribution to the ISB. These 
cases are presented to illustrate the potential value 
ot the ISB and it is emphasized by the authors that 
only some of the possible interpretations are in- 
cluded in the analysis. As one reads the analyses and 
studies the responses other interpretations often seem 
plausible. This in a sense is an indication of the 
value of the test since one of the main purposes of 
a projective situation is to furnish leads that may 
be explored in an effort to gain clinical insights. 
The authors are effective in showing how the ISB 
adds to their understanding of clients. It would be 
interesting to know how the ISB’s contribution com- 
pares with that of the Rorschach or the TAT. Spe- 
cific information about the relative contribution of 
the several projective devices is urgently needed. 

The authors of the manual are careful to point out 
the limitations of the ISB and to remind readers that 
all data presented relate to the college form. It is 
somewhat unfortunate that high school and adult 
forms are listed as being available. Until such time 
as manuals and norms are available it would seem 
desirable to keep such listings out of test catalogues. 
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